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vectors comprising these nucleic acids, as well as the production of FATP proteins in host cells are described. Also described are methods 
to test FATPs for fatty acid transport function, and methods to identify inhibitors or enhancers of transport function. The altering of LCFA 
uptake by administering to the mammal an inhibitor or enhancer of FATP transport function of a FATP in the small intestine can decrease 
or increase calories available as fats, and can decrease or increase circulating fatty acids. The organ specificity of FATP distribution can 
be exploited in methods to direct drugs* diagnostic indicators and so forth to an organ such as the heart. 
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FATTY ACID TRANSPORT PROTEINS 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application 
Number 60/071,374 entitled "Identification of a Family of Fatty Acid Transporters 
5 Conserved From Mycobacterium to Man," by Andreas Stahl, David Hirsch and 
Harvey F. Lodish, filed on January 15, 1998; U.S. Provisional Application Number 
60/093,491 entitled 'Tatty Acid Transport Proteins," by Andreas Stahl, David J. 
Hirsch, Harvey F. Lodish, Ruth E. Gimeno and Louis A. Tartaglia, filed on July 20, 
1998; and U.S. Provisional Application Number 60/1 10,941 entitled 'Tatty Acid 

10 Transport Proteins," by Andreas Stahl, David J. Hirsch, Harvey F. Lodish, Ruth E. 
Gimeno and Louis A. Tartaglia, filed on December 4, 1998. This application also 
claims priority to Attorney's Docket Nos. WHI97-21p3MA, WHI97-21p3MB, 
WHI97-21p3MC, WHI97-21p3MD, each of which is entitled "Fatty Acid Transport 
Proteins," by Andreas Stahl, David J. Hirsch, Harvey F. Lodish, Ruth E. Gimeno 

15 and Louis A. Tartaglia, filed on January 14, 1999. The teachings of each of these 
referenced applications are incorporated herein by reference in their entirety. 
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B ACKGROUND OF THE INVENTION 

Long chain fatty acids (LCFAs) are an important source of energy for most 
organisms. They also function as blood hormones, regulating key metabolic functions 
such as hepatic glucose production. Although LCFAs can diffuse through the 
5 hydrophobic core of the plasma membrane into cells, this nonspecific transport cannot 
account for the high affinity and specific transport of LCFAs exhibited by cells such as 
cardiac muscle, hepatocytes, enterocytes, and adipocytes. The molecular mechanisms 
of LCFA transport remains largely unknown. Identifying these mechanisms can lead to 
pharmaceuticals that modulate fatty acid uptake by the intestine and by other organs, 
10 thereby alleviating certain medical conditions (e.g. obesity). 

SUMMARY OF THE INVENTION 

Described herein is a diverse family of fatty acid transport proteins (FATPs) 
which are evolutionarily conserved; these FATPs are plasma membrane proteins which 
mediate transport of LCFAs across the membranes and into cells. Members of the 

1 5 FATP family described herein are present in a wide variety of organisms, from 

mycobacteria to humans, and exhibit very different expression patterns in tissues among 
the organisms. FATP family members are expressed in prokaryotic aid eukaryotic 
organisms and comprise characteristic amino acid domains or sequences which are 
highly conserved across family members. In addition, the function of the FATP gene 

2 0 family is conserved throughout evolution, as shown by the fact that the Caenorhabditis 
. (C). elegans and mycobacterial FATPs described herein facilitate LCFA uptake when 
they are overexpressed in COS cells or Escherichia (E.) coll respectively. FATPs are 
expressed in a wide variety of tissues, including all tissues which are important to fatty 
acid metabolism (uptake and processing). 

2 5 In specific embodiments, FATPs of the present invention are from such diverse 

organisms as humans (Homo (K) sapiens), mice, (Mus (M.) musculus), F. rubripes, C. 
elegans, Drosophila (D.) melanogaster, Saccharomyces (S.) cerevisiae, Aspergillus 
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nidulans t Cochliobolu heterostrophus, Magnaporthe grisea and Mycobacterium (M.) t 
such as M, tuberculosis. As described herein, four novel mouse FATPs, referred to as 
mmFATP2, mmFATP3 } rnmFATP4 and mmFATPS, and six human FATPs, referred to 
as hsFATPl, hsFATP2, hsFATP3, hsFATP4, hsFATPS and hsFATP6, have been 
5 identified. All four novel murine FATPs (mmFATP2-5) and a previously identified 
murine FATP (renamed herein FATP1) have orthologs in humans (hsFATPl-5); the 
sixth human FATP (hsFATP6) does not as yet have a mouse ortholog. The expression 
patterns of these FATPs vary, as described in detail below. . 

The present invention relates to FATP family members from prokaryotes and 

1 0 eukaryotes, nucleic acids (DNA, RNA) encoding FATPs, and nucleic acids which are 
useful as probes or primers (e.g., for use in hybridization methods, amplification 
methods) for example, in methods of detecting FATP-encoding genes, producing 
FATPs, and purifying or isolating FATP-encoding DNA or RNA. Also the subject of 
this invention are antibodies (polyclonal or monoclonal) which bind an FATP or 

15 FATPs; methods of identifying additional FATP family members (for example, 

orthologs of those FATPs described herein by amino acid sequence) and variant alleles 
of known FATP genes; methods of identifying compounds which bind to an FATP, or 
modulate or alter (enhance or inhibit) FATP function; compounds which modulate or 
alter FATP function; methods of modulating or altering (enhancing or inhibiting) FATP 

2 0 function and, thus, LCFA uptake into tissues of a mammal (e.g. human) by 

administering a compound or molecule (a drug or agent) which increases or reduces 
FATP activity; and methods of targeting compounds to tissues by administering a 
complex of the compound to be targeted to tissues and a component which is bound by 
an FATP present on cells of the tissues to which the compound is to be targeted. For 

2 5 example, a complex of a drag to be delivered to the liver and a component which is 
bound by an FATP present on liver cells (e.g., FATPS) can be administered. 

In one embodiment, the present invention relates to modulating or altering 
(enhancing or inhibiting/reducing) LCFA uptake in the small intestine and, thus, 
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increasing or reducing the number of calories in the form of fats available to an 
individual. In another embodiment, the present invention relates to inhibiting or 
reducing LCFA uptake in the small intestine in order to reduce circulating fatty acid 
levels; that is, LCFA uptake in the small intestine is reduced and, therefore, circulating 
5 (blood) levels are not as high as they otherwise would be. FATP4 has been shown to be 
expressed in epithelial cells of the small intestine and particularly in the brush border 
layer of the small intestine. FATP2 has also been shown to be expressed at low levels 
in epithelial cells of the small intestine, particularly in the duodenum. In contrast, 
FATP1 , FATP3, FATP5 and FATP6 were not detected in any of the intestinal tissues. 

1 0 Thus, also described herein are FATPs which are present in the epithelial cell layer of 
the small intestine where they mediate LCFA uptake. These FATPs, particularly 
FATP4 and also FATP2, are targets for methods and drugs which block their function 
or activity and are useful in treating obesity, diabetes and heart disease. The ability of 
these FATPs to mediate fat uptake can be modulated or altered (enhanced or inhibited), 

15 thus modulating fat uptake in the small intestine. This can be done, for example, by 
administering to an individual, such as a human or other animal, a drug which blocks 
interaction of LCFAs with FATP4 and/or FATP2 in the small intestine, thus inhibiting 
LCFA passage into the cells of the small intestine. As a result, fat absorption is reduced 
and, although the individual has consumed a certain quantity of fat, the LCFAs are not 

2 0 absorbed to the same extent they would have been in the absence of the compound 
administered. 

Thus, one embodiment of this invention is a method of reducing LCFA uptake 
(absorption) in the small intestine and, as a result, reducing caloric uptake in the form of 
fat. A further embodiment is a compound (drug) useful in inhibiting or reducing fat 
2 5 absorption in the small intestine, hi another embodiment, the invention is a method of 
reducing circulating fatty acid levels by administering to an individual a compound 
which blocks interactions of LCFAs withFATP4 and/or FATP2 in the small intestine, 
thus inhibiting LCFA passage into cells of the small intestine. As a result, fatty acids 
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pass into the circulatory system at a diminished level and/or rate, and circulating fatty 
acid levels are lower than they would be in the absence of the compound administered. 
This method is particularly useful for therapy in individuals who are at risk for or have 
hyperlipidemia. That is, it can be used to prevent the occurrence of elevated levels of 
5 lipids in the blood or to treat an individual in whom blood lipid levels are elevated. 
Also the subject of this invention is a method of identifying compounds which alter 
, FATP function (and thus, in the case of FATP2 and/or FATP4, alter LCFA uptake in 
the small intestine). 

In another embodiment, the present invention relates to a method of modulating 

1 0 or altering (enhancing or inhibiting) the function of FATP6, which is expressed at high 
levels in the heart. A method of inhibiting FATP6 function is useful, for example, in 
individuals with heart disease, such as ischemia, since reducing LCFA uptake into heart 
muscle in an individual who has ischemic heart disease, which may be manifested by, 
for example, angina or heait attack, can reduce symptoms or reduce the extent of 

1 5 damage caused by the ischemia. In this embodiment, a drug which inhibits FATP6 
function is administered to an individual who has had or is having a heart attack, to 
reduce LCFA uptake by the individual's heart and, as a result, reduce the damage caused 
by ischemia. In a further embodiment, this invention is a method of targeting a 
compound, such as a therapeutic drug or an imaging reagent, to heart tissue by 

2 0 administering to an individual (e.g., a human) a complex of the compound and a 

component (e.g., a LCFA or LCFA-like compound) which is bound by an FATP (e.g., 
FATP 6) present in cells of heart tissue. 

In a further embodiment, LCFA uptake by the liver is modulated or altered 
(enhanced or reduced), in an individual. For example,,, a drug which inhibits the 

2 5 function of an FATP present in liver (e.g., FATP5) is administered to an individual who 
is diabetic, in order to reduce LCFA uptake by liver cells and, thus reduce insulin 
resistance. 
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The present invention, thus, provides methods which are useful to alter, 
particularly reduce, LCFA uptake in individuals and, as a result, to alter (particularly 
reduce), availability of the LCFAs for further metabolism. In a specific embodiment, 
the present invention provides methods useful to reduce LCFA uptake and, thus, fatty 
5 . acid metabolism in individuals, with the result that caloric availability from fats is 

reduced, and circulating fatty acid levels are lower than they otherwise would be. These 
methods are useful, for example, as a means of weight control in individuals, (e.g., 
humans) and as a means of preventing elevated serum lipid levels or reducing serum 
lipid levels in humans. FATPs expressed in the small intestine, such as FATP4, are 

1 0 useful targets to be blocked in treating obesity (e.g., chronic obesity) or to be enhanced 
in treating conditions in which enhanced LCFA uptake is desired (e.g., malabsorption 
syndrome or other wasting conditions). 

The identification of this evolutionarily conserved fatty acid transporter family 
will allow a better understanding of the mechanisms whereby LCFAs traverse the lipid 

1 5 bilayer as well as yield insight into the control of energy homeostasis and its 
dysregulation in diseases such as diabetes and obesity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the amino acid sequence alignment of FATPs: mmFATPl (SEQ 
ID NO:92), mmFATP2 (SEQ ID NO:93), mmFATP3 (SEQ ID NO:94), mmFATP4 

2 0 (SEQ ID NO:95), mmFATP5 (SEQ ID NO:96), ceFATPa (SEQ ID NO:97), scFATP 
(SEQ ID NO:98) and mtFATP (SEQ ID NO:99). The underlining (amino acid residues 
204-212 of mtFATP) indicates an AMP binding motif which is found in many classes 
of proteins; the underlining at amino acid residues 204-507 of the mtFATP sequence 
indicates the FATP 360 amino acid signature sequence. 

2 5 Figures 2 A-2D show results of LCFA uptake assays. Figures 2A-2D: COS 

cells were cotransfected using the DEAE-dextran method with the mammalian 
expression vectors pCDNA-CD2 either alone (control; Figure 2A) or in combination 
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with one of the F ATP -containing expression vectors (pCDNA-mmFATPl, Figure 2B; 
pCDNA-mmFATP2, Figure 2C; or pCMV-SPORT2-mmFATP5, Figure 2D) as 
described in Materials and Methods for Example 2. COS cells were gated on forward 
scatter (FSC) and side scatter (SS), and the results shown represent >1 0,000 cells. Cells 
5 exhibiting >300 CD2 fluorescence units (vertical line) representing 15% of all cells 
were deemed CD2 positive. 

Figure 3 is a graph of fluorescence of cells expressing a FATP gene. As in 
Figures 2A-2D, COS cells were cotransfected withpCDNA-CD2 either alone (control) 
or in combination with one of the FATP-containing expression vectors (pCDNA- 

1 0 mmFATP 1 , pCDNA-mmFATP2, pCMV-SPORT2-mmFATP5, or pCDN A-ceF ATPb) . 
The mean BODIPY-FA fluorescence of the CD2-positive cells is plotted; results shown 
represent tire average of three experiments, each consisting of greater than 50,000 COS 
cells. Note that a logarithmic scale is used on the ordinate. 

Figure 4 is a graph of the uptake of palmitate with time. The full-length coding 

15 region of mtFATP (squares) or a control protein (TFE3; circles) was subcloned into the 
inducible, prokaryotic expression vector pET (Novagen). Expression from the resulting 
plasmid was induced (solid symbols) in transformed E. coli cells with 1 mM isopropyl- 
p-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced (open symbols). 
Data points were done in triplicate and counts were normalized to the number of 

2 0 bacteria as determined by OD 600 . 

Figure 5 is a phylogenetic tree produced by aligning complete and partial 
sequences for FA TP genes from human, rat, mouse, puffer fish, D. melanogaster, C 
elegans, S. cerevisiae, and M. tuberculosis using ClustalX and using these data to 
produce a phylogenetic tree using TreeViewPPC. The bar indicates the number of 

25 substitutions per residue, i.e., 0.1 corresponds to a distance of 10 substitutions per 100 
residues. 
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Figure 6 shows a comparison of the FATP signature sequences of mrnFATP 1 . 
(SEQ ID NO:l), mmFATPS, (SEQ ID NO:2), ceFATPa (SEQ ID NO:3), scFATP (SEQ 
ID NO:4) and mtFATP (SEQ ID NO:5). 

Figure 7 shows the sequence identity among the FATP family members and 
5 VLACs, based on the 360 amino acid signature sequence of FATP from Figure 1 . 

Figures 8A and 8B axe the mmFATP3 DNA sequence (SEQ ID NO:6). 

Figure 9 is the mmFATP3 protein sequence (SEQ ID NO:7). 

Figures 10A and 10B are the mmFATP4 DNA sequence (SEQ ID NO:8). 

Figure 1 1 is the mmFATP4 protein sequence (SEQ ID NO:9). 
. 0 Figures 1 2A and 1 2B are the mmFATPS DNA sequence (SEQ ID NO: 1 0). 

Figure 13 is the mmFATPS protein sequence (SEQ ID NO:l 1). 

Figures 14A and 14B are the hsFATP2 DNA sequence (SEQ ID NO: 12). 

Figure 15 is the hsFATP2 protein sequence (SEQ ID NO: 13). 

Figures 16A and 16B are the hsFATP3 DNA sequence (SEQ ID NO: 14). 
5 Figure 17 is the hsFATP3 protein sequence (SEQ ID NO:15). 

Figures 18 A and 18B are the hsFATP4 DNA sequence (SEQ ID NO: 16). 

Figure 19 is the hsFATP4 protein sequence (SEQ ID NO: 17). 

Figures 20A and 20B are the hsFATPS DNA sequence (SEQ ID NO: 18). 

Figure 21 is the hsFATPS protein sequence (SEQ ID NO: 19), 
0 Figures 22A and 22B are the hsFATP6 DNA sequence (SEQ ID NO:20). 

Figure 23 is the hsFATP6 protein sequence (SEQ ID NO:21). 

Figures 24A and 24B are the mtFATP DNA sequence (SEQ ID NO:22). 

Figure 25 is the mtFATP protein sequence (SEQ ID NO: 23). 

Figure 26 shows the DNA sequence (SEQ ID NO:24) and predicted amino' acid 
5 sequence (SEQ ID NO:25) of human FATP1 . 

Figure 27 shows the DNA sequence (SEQ ID NO:26) and predicted amino acid 
sequence (SEQ ID NO:27) of human FATP4. 
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Figure 28A.is a hydrophobicity plot for hsFATPl, showing that it has multiple 
membrane-spanning domains. 

Figure 28B is the amino acid composition of hsFATPl. 

Figure 28C is a hydrophilicity plot for hsFATPl, made using the Kyte-Doolittle 
5 method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 29A is a hydrophobicity plot for hsFATP4, showing that it has multiple 
membrane-spanning domains. 

Figure 29B is a listing of the amino acid composition of hsFATP4. 

Figure 29C is a hydrophilicity plot for hsFATP4, made using the Kyte-Doolittle 
1 0 method, averaging hydrophilicity values for 1 8 amino acid residues at a time. 

Figures 30A and 30B show a comparison of the nucleotide sequence of human 
FATP1 (SEQ ID NO:28) and the nucleotide sequence of mouse FATP1 (SEQ ID 
NO:29) v 

Figures 3 1 A and 3 IB show a comparison of the nucleotide sequence of human 
15 FATP4 (SEQ ID NO:30) and the nucleotide sequence of mouse FATP4 (SEQ ID 
NO;31). 

Figure 32 shows a comparison of the amino acid sequence of human FATP1 
(SEQ ID NO: 3 2) and the amino acid sequence of mouse FATP1 (SEQ ID NO:33). 
Shaded amino acid residues match the concensus sequence exactly 
2 0 Figure 33 shows a comparison at the amino acid level of human FATP4 (SEQ 

ID NO:34) and mouse FATP4 (SEQ ID NO:35). Shaded amino acid residues match the 
concensus sequence exactly. 

Figure 34 shows the nucleotide sequence (SEQ ID NO:36) and predicted amino 
acid sequence (SEQ ID NO:37) of hsFATP6. 
2 5 Figure 35 A is a hydrophobicity plot for hsFATP6, showing that it has multiple 

membrane-spanning domains. 

Figure 35B is a listing of the amino acid composition of hsFATP6. 
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Figure 35C is a hydrophilicity plot for hsFATP6, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 36 shows an alignment of the amino acid sequences of hsFATPl (SEQ 
ID NO:38), hsFATP4 (SEQ ID NO:39) and hsFATP6 (SEQ ID NO:40). Shaded amino 
5 acid residues match the concensus sequence exactly. 

Figure 37 shows results of assessment of fatty acid uptake by human FATP 1 and 
human FATP4. The percent of CD2-positive cells exhibiting a BODIPY-fLuorescence 
of more than 300 arbitrary units is plotted for the three different conditions tested. 

Figure 38 is a graph showing uptake of tritiated oleate, with time, by 293 cells 
1 0 transfected with either (diamonds) a plasmid for expression of human FATP4 or 
(squares) a control plasmid. 

Figure 39 is an illustration of the amino acid sequences of human F ATP4 (SEQ 
ID NO:41) and mouse FATP4 (SEQ ID NO:42) compared to human FATP1 (SEQ ID 
NO:43). Shown by underlining iare the FATP consensus sequence (236-556 of 
15 hsFATPl) and the AMP-binding motif (246-254 of hsFATPl). The human FATPs 
were cloned by screening libraries with sequences from ESTs (expressed sequence 
tags). Mouse FATP4 was cloned by PGR using degenerate primers. 

Figure 40 is a graph showing the uptake, with time, of tritiated oleate by mouse 
enterocytes in the presence of no oligonucleotide (squares), sense oligonucleotide 
2 0 (circles) or antisense oligonucleotide (diamonds). 

Figure 41 is a bar graph showing uptake of tritiated oleate, by mouse enterocytes 
in the presence of various concentrations of antisense (solid bars), mismatch (stippled 
bars) or sense (lined bars) oligonucleotides. 

Figure 42 is a bar graph showing uptake of tritiated oleate and uptake of 35 S- 
2 5 labeled methionine by mouse enterocytes to which were added no oligonucleotide, the 
antisense oligonucleotide, or the mismatch oligonucleotide. 

Figure 43 A is the nucleotide sequence of the gene encoding mouse FATP4 (SEQ 
IDNO:44). 
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Figure 43B is the amino acid sequence of mouse FATP4 protein (SEQ ID 
NO:45). 

Figures 44A, 44B, and 44C are the hsFATPl DNA sequence (SEQ ID NO: 46). 
Coding region: 175-2115 (1941 nt). 
5 Figure 45 is the hsFATP 1 protein sequence (SEQ ID NO:47). 

Figures 46A and 46B are the hsFATP2 DNA sequence (SEQ ID NO:48). 
Coding region: 223-2085 (1863 nt). 

Figure 47 is the hsFATP2 protein sequence (SEQ ID NO:49). 
Figure 48 is the partial DNA sequence of hsFATP3 (SEQ ID NO:50). Coding 
10 region: 1.-993. 

Figure 49 is the partial protein sequence of hsFATP3 (SEQ ID NO: 51). 
Figures 50A, 50B, and 50C are the hsFATP4 DNA sequence (SEQ ID NO:52). 
Coding region: 208-2139 (1932 nt). 

Figure 5 1 is the hsFATP4 protein sequence (SEQ ID NO:53). 
1 5 Figure 52 is the hsFATP5 partial DNA sequence (SEQ ID NO:54). Coding 

region: 1-1062. 

Figure 53 is the hsFATP5 partial protein sequence (SEQ ID NO:55). 
Figures 54A, 54B, and 54C are the hsFATP6 DNA sequence (SEQ ID NO:56). 
Coding region: 643-2502 (1860 nt). 
2 0 Figure 55 is the hsFATP6 protein sequence (SEQ ID NO:57). 

Figures 56A, 56B, and 56C are the rnFATPl DNA sequence {xn=Rattus 
norvegicus; (SEQ ID NO:58). Coding region: 75-2015 (1941 nt). 

Figure 57 is the mFATP 1 protein sequence (SEQ ID NO:59). 
Figure 58A, 58B, and 58C are the ruFATP2 DNA sequence (SEQ ID NO:60). 
2 5 Coding region: 795-2657 (1 863 nt). 

Figure 59 is the rnFATP2 protein sequence (SEQ ID NO:61). 
Figure 60A and 60B are the mFATP4 partial DNA sequence (SEQ ID NO: 62). 
Coding region: 1-1218. 
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Figure 6 1 is the mFATP4 partial DNA sequence (SEQ ID NO:63). 

Figure 62A, 62B, and 62C are the mmFATPl DNA sequence (SEQ ID NO:64). 
Coding region: 1-1944. 

Figure 63 is the mmFATPl protein sequence (SEQ ID NO:65). 
5 Figures 64A and 64B are the mmFATP2 DNA sequence (SEQ ID NO:66). 

Coding region: 121-1992 (1872 nt). 

Figure 65 is the mmFATP2 protein sequence (SEQ ID NO:67). 

Figures 66A and 66B are the mmFATP3 partial DNA sequence (SEQ ID 
NO:68). Coding region: 1-1830. 

1 o Figure 67 is the mmFATP3 partial protein sequence (SEQ ID NO:69). 

Figures 68A, 68B, and 68C are the mmFATP4 DNA sequence (SEQ ID NO:70). 
Coding region: 1-1932. 

Figures 69 is the mmFATP4 protein sequence (SEQ ID NO:71). 

Figures 70A and 70B are the mmFATP5 DNA sequence (SEQ ID NO:72). 
15 Coding region: 60-2129. 

Figure 71 is the mmFATP5 protein sequence (SEQ ID NO:73). 

Figures 72A and 72B are the dmFATP partial DNA sequence (dm~Drosophila 
melanogaster; SEQ ID NO:74). Coding region: 1-1773. 

Figures 73 is the dmFATP partial protein sequence (SEQ ID NO:75). 

2 o Figure 74 is the drFATP partial DNA sequence (dx-Danio rerio, zebrafish; SEQ 

ID NO-.76) Coding region: 1-173. 

Figure 75 is the drFATP partial protein sequence (SEQ ID NO:77). 

Figure 76A and 76B are the ceFATPa DNA sequence (SEQ ID NO:78). Coding 
region: 1-1953. 

2 5 Figure 77 is the ceFATPa protein sequence (SEQ ID NO:79). 

Figures 78A and 78B are the ceFATPb DNA sequence (SEQ ID NO:80). 
Coding region: 1-1968. 

Figure 79 is the ceFATPb protein sequence (SEQ ID NO:81). 
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Figures 80A and 80B are the chFATP DNA sequence (SEQ ID NO:82; 
chr=Cochliobolu heterostrophus). Coding region: 1-1932. 

Figure 81 is the chFATP protein sequence (SEQ ID NO:83). 

Figure 82 is the anFATP partial protein sequence (an=Aspergillus nidulans; 
5 SEQIDNO:84). Coding region: 1-597. 

Figure 83 is the anFATP partial protein sequence (SEQ ID NO: 85). 

Figure 84 is the ingFATP partial DNA sequence (mg= Magnaporthe grisea, rice 
blast; SEQ ID NO: 86). Coding region: 1-522. 

Figure 85 is the mgFATP partial protein sequence (SEQ ID NO:87). 
10 Figures 86A and 86B are the scFATP DNA sequence (SEQ ID NO:88). Coding 

region: 1-1872. 

Figure 87 is the scFATP protein sequence (SEQ ID NO:S9). 

Figures 88 A and 88B are the mtFATP DNA sequence (SEQ ID NO:90). 

Figure 89 is the mtFATP protein sequence (SEQ ID NO:91). Coding region: 1- 

15 1794 

Figure 90 is a concensus sequence of the FATP signature sequence (SEQ ID 

NO: 

1 00), based on 23 independent sequences aligned in ClustalX. The height of the bar at 
each amino acid residue position indicates the degree of conservation at that position. 
2 0 Gaps have been inserted to maintain the strength of the alignment. 

Figure 91 is a hydrophilicity plot for hsFATP2, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 1 8 amino acid residues at a time. 

Figure 92 is a hydrophilicity plot for the hsFATP3 partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
2 5 at a time. 

Figure 93 is a hydrophilicity plot for the hsFATPS partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
at a time. 
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Figures 9*4 A and 94B are a representation of the DNA sequence (SEQ ID 
NO: 101) of the hsFATP5 gene, and the amino acid sequence (SEQ ID NO: 102) of the 
hsFATPS protein. 

DETAILED DESCRIPTION OF THE INVENTION 
5 As described herein, FATPs are a large evolutionarily conserved family of 

proteins that mediate the transport of LCFAs into cells. The family includes proteins 
which axe conserved from mycobacteria to humans and exhibit very different expression 
patterns in tissues. Specific embodiments described include FATPs from mice, humans, 
nematodes, fungi and mycobacteria which have been shown to be functional LCFA 

1 0 transporters. The term "fatty acid transport proteins" ("FATPs") as used herein, refers 
to the proteins described herein as FATP1, FATP2, FATP 3, FATP4, FATPS and 
FATP6, which have been described in one or more species of mammals, as well as 
mtFATP, ceFATP, scFATP, anFATP, mgFATP, and chFATP, and other proteins 
sharing at least about 50% amino acid sequence similarity, preferably at least about 60% 

1 5 sequence similarity, more preferably at least about 70% sequence similarity, and still 
more preferably, at least about 80% sequence similarity, and most preferably, at least 
about 90% sequence similarity in the approximately 360 amino acid signature sequence. 
The approximaely 360 amino acid FATP signature sequence is shown in Figure 1. The 
concensus sequence of the signature sequence is shown in Figure 90. The nomenclature 

2 0 used herein to refer to FATPs. includes a species-specific prefix (e.g., mm, Mus 

musculus; hs or h, Homo sapiens or human; mt M tuberculosis; dm. D. melanogaster\ 
ce, C. elegans\ sc, Saccharomyces cereyisiae) and a number such that mammalian 
homologues in different species share the same number. For example, six human and 
five mouse FATP genes which are expressed in a variety of tissues are described herein 

2 5 and are referred to, respectively, as hsF ATP 1 -hsFATP6 and mmF ATP 1 -mmF ATP 5 ; for 
example, hsFATP4 and mmFATP4 are the human and mouse orthologs. 
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Expression patterns of human and mouse FATPs have been assessed and are 
described below. Briefly, results of these assessments show that FATP5 is a liver- 
specific gene. FATP2 is highly expressed in liver and kidney. Both of these proteins, 
as well as FATP4 and FATPs from nematodes and mycobacteria, have been shown to 
5 be functional LCFA transporters. Results have also shown that FATP4 mRNA is 
present at high levels in epithelial cells of two regions of the small intestine (the 
jejunum and ileum) and at lower, but significant, levels in a third region (the 
duodenum). They further showed that FATP2 mRNA is present in epithelial cells of the 
duodenum at a level similar to that of FATP4 mRNA levels, but is present at lower 

1 0 levels in the jejunum and ileum. FATP4 mRNA was absent from other cell types of the 
small intestine and no FATP4 mRNA could be detected in any cells of the colon. No 
signals above background could be detected for FATP1, FATP3 and FATPS in any of 
die intestinal tissues. Thus, FATP4 is the major FATP in the mouse small intestine, 
which supports a major role for FATP4 (along with FATP2 to a lesser extent) in 

15 absorption of free fatty acids. hsFATP4 was clearly expressed in the jejunum and 

ileum; expression was absent in the stomach. This, too, is- consistent with a major role 
for FATP4 in absorption of fatty acids in the human gut. Analysis of FATP expression 
in human tissues, also described in detail below, showed that hsFATP6, which has no 
mouse ortholog as yet, is expressed at high levels in the heart and at low levels in the 

2 0 placenta, but is undetectable in the other tissues assessed (Example 9). This is 
consistent with a major role for FATP6 in absorption of fatty acids in the heart. 

Long chain fatty acids (LCFAs) are an important' energy source for pro- and 
eukaryotes and are involved in diverse cellular processes, such as membrane synthesis, 
intracellular signaling, protein modification and transcriptional regulation. In 

2 5 developed Western countries, human dietary lipids are mainly di- and triglycerides and 
account for approximately 40% of caloric intake (Weisburger, J. H. (1997) J. Am. Diet 
Assoc. 07:S16-S23). These lipids are broken down into fatty acids and glycerol by 
pancreatic lipases in the small intestine (Chapus, C, Rovery, M., Sarda, L & Verger, R. 



WO 99/36537 



PCT/US99/00182 



-16- 

(1988) Biochimie 70:1223-34); LCFAs are then transported into brush border cells, 
where the majority is re-esterified and secreted into the lymphatic system as 
chylomicrons (Green, P.H. & Riley, J.W. (1981) Aust. M.Z.J. Med. 77:84-90). Fatty 
acids axe liberated from lipoproteins by the enzyme lipoprotein lipase, which is bound to 
5 the luminal side of endothelial cells (Scow, R.O. & Blachette-Mackie, EJ. (1992) Mol 
Cell Biochem 77(5:181-191). "Free" fatty acids in the circulation are bound to serum 
albumin (Spector, A.A. (1984) Clin. Physiol Biochem 2:123-134) and are rapidly 
incorporated by adipocytes, hepatooytes, and cardiac muscle cells. The latter derive 60- 
90% of their energy through the oxidation of LCFAs (Neely, IF. Rovetto, MI & 
10 Oram, J.F. (1972) Prog. Cardiovasc, Dis: 75:289-329). Although saturable and specific 
uptake of LCFAs has been demonstrated for intestinal cells, hepatocytes, cardiac 
myocytes, and adipocytes, the molecular mechanisms of LCFA transport across the 
plasma membrane have remained controversial (Hui, T.Y. & Bemlohr, D.A. (1997) 
Front. BioscL 7J:d222-31-d231; Schaffer, IE. & Lodish, H.F, (1995) Trends 
15 Cardiovasc. Med. 5:218-224). Described herein is a large family of highly homologous 
- mammalian LCFA transporters which show wide expression, including in all tissues 
relevant to fatty acid metabolism. Further described are novel members of this family 
in other species, including mycobacterial and nematode FATPs which, like their 
mammalian counterparts, are functional fatty acid transporters. 
2 0 The discovery of a diverse but highly homologous family of FATPs is 

reminiscent of the glucose transporter family. In a manner similar to the FATPs, the 
glucose transporters have very divergent patterns of tissue expression (McGowan, K.M., 
Long, S.D. & Pelcala, P.H. {1995) Pharmacol Then 65:465-505). The FATPs, like 
glucose transporters, may also differ in their substrate specificities, uptake kinetics, and 
25 ■ hormonal regulation (Thorens, B. (1996) Am. J. Physiol 270:G541-G553). Indeed, the 
levels of fatty acids in the blood, like those of glucose, can be regulated by insulin and 
are dysregulated hi diseases such as noninsulin-dependent diabetes and obesity (Boden, 
G. (1997) Diabetes 46:3-10): The underlying mechanisms for the regulation of free 
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fatty acid concentrations in the blood are not understood, but could be explained by . 
hormonal modulation of FATPs. 

Insulin-resistance is thought to be the major defect in non insulin-dependent 
diabetes mellitus (NIDDM) and is one of the earliest manifestations of NIDDM 
5 (McGarry (1992) Science 258:766-770). Free fatty acids (FFAs) may provide an 
explanation for why obesity is a risk factor for NIDDM. Plasma levels of FFAs are 
elevated in diabetic patients (Reaven et al (1988) Diabetes 37:1020). Elevated plasma 
free fatty acids (FFAs) have been demonstrated to induce insulin-resistance in whole 
animals and humans (Boden (1998) Front Biosci 3:D169-D175). This insulin- 
10 ' resistance is likely mediated by effects of FFAs on a variety of issues. FFAs added to 
adipocytes in vitro induce insulin resistance in this cell type as evidenced by inhibition 
of insulin-induced glucose transport (Van Epps-Fung et al (1997) Endocrinology 
138:4338-4345). Rats fed a high fat diet developed skeletal muscle insulin resistance as 
evidenced by a decrease in insulin-induced glucose uptake by skeletal muscle (Han et 
15 al, (1 997) Diabetes 46: 1761 -1767). In addition, elevated plasma FFAs increase 

insulin-suppressed endogenous glucose production in the liver (Boden (1998) Front 
Biosci. 3:D169-D175), thus increasing hepatic glucose output. It has been postulated 
that the adverse effects of plasma free fatty acids are due to the FFAs being taken up 
into the cell, leading to an increase in intracellular long chain fatty acyl Co A; 
2 0 intracellular long chain acyl CoAs are thought to mediate the effects of FFAs inside the 
cell. Thus, fatty acid induced insulin-resistance may be prevented by blocking uptake 
of FFAs into select tissues, in particular liver (by blocking FATP2 and/or FATPS), 
adipocyte (by blocking FATP1), and skeletal muscle (by blocking FATP1). Blocking 
intestinal fat absorption (by blocking FATP4) is also expected to reduce plasma FFA 
2 5 levels and thus improve insulin resistance. 

During the pathogenesis of NIDDM insulin-resistance can initially be 
counteracted by increasing insulin output by the pancreatic beta cell. Ultimately, this 
compensation fails, beta cell function decreases and overt diabetes results (McGarry 
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(1 992) Science 258: 766-770). Manipulating beta cell function is a second point where 
fatty acid transporter blockers may be beneficial for diabetes. While no FATP homolog 
has been identified so far that is expressed in the beta cell of the pancreas, the data 
described below suggest the existence of such a transporter and the sequence 
5 information included herein provides the means to identify such a transporter by 

degenerate PCR, using primers to regions conserved in all FATP family members or by 
low stringency hybridization, It has been demonstrated that exposure of pancreatic 
beta-cells to FFAs increases the basal rate of insulin secretion; this in turn leads to a 
decrease in the intracellular stores of insulin, resulting in decreased capacity for insulin 

10 secretion after chronic exposure (Bollheimer et aL, (1998) J. Clin. Invest 101 :1094- 
1.101). The effects of FFAs are again likely to be mediated by intracellular long chain 
fatty acyl CoA molecules (Liu et al, (1998)/. Clin. Invest 101:1870-1875). FFAs 
have also been demonstrated to increase beta cell apoptosis (Shimabulcuro et at, (1998) 
Proc. Nat Acad. Set USA 95:2498-2502), possibly contributing to the decrease in beta 

1 5 cell numbers in late stage NIDDM 

Another finding with potentially broad implications is the identification of a 
FATP homologue inM tuberculosis. Tuberculosis causes more deaths worldwide than 
any other infectious agent and drug-resistant tuberculosis is re-emerging as a problem in 
industrialized nations (Bloom, B.R. & Small, P.M. (1998) N. Engl J. Pled 338:611- 

2 0 678). Mycobacterium tuberculosis has about 250 enzymes involved in fatty acid 

metabolism, compared with only about 50 inE. colt It has been suggested that, living 
as a pathogen, the mycobacteria are largely lipolytic, rather than lipogenic, relying on 
the lipds within mammalian cells and the tubercle (Cole, S.T. et al, Nature 393:531- 
544 (1998)). The de novo synthesis of fatty acids in Mycobacterium leprae is 

2 5 insufficient to maintain growth (Wheeler, P.R., Bulmer, K & Ratledge, C. (1990) X 
Gene. Microbiol 136:21 1-217). Thus, it is reasonable to expect that inhibitors of 
mtFATP will serve as therapeutics for tuberculosis. FATPs expressed in mycobacteria 
can be targeted to reduce or prevent replication of mycobacteria (e.g., to reduce or 
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prevent replication of M tuberculosis) and, thus, reduce or prevent their adverse effects. 
For example, a FATP or FATPs expressed by M tuberculosis can be targeted and 
inhibited, thus reducing or preventing growth of this pathogen (and tuberculosis in 
humans and other mammals). An inhibitor of ai\M. tuberculosis FATP can be 
5 identified, using methods described herein (e.g., expressing the FATP in an appropriate 
host cell, such as E. coli or COS cells; contacting the cells with an agent or drug to be 
assessed for its ability to inhibit the FATP and, as a result, mycobacterial growth, and 
assessing its effects on growth). A drug or agent identified in this manner can be further 
tested for its ability to inhibit aM tuberculosis FATP and M tuberculosis infection in 

10 an appropriate animal model or in humans. A method of inhibiting mycobacterial 
growth, particularly growth of M tuberculosis, and compounds useful as drugs for 
doing so are also the subject of this invention. 

An isolated polynucleotide encoding mtFATP, like other polynucleotides 
encoding FATPs of the FATP family, can be incorporated into vectors, nucleic acids of 

15 viruses, and other nucleic acid constructs that can be used in various types of host cells 
to produce mtFATP. This mtFATP can be used, as it appears on the surface of cells, or 
in various artificial membrane systems, to assess fatty acid transport function, to 
identify ligands and molecules that are modulators of fatty acid transport activity. 
Molecules found to be inhibitors of mtFATP function can be incorporated into 

2 0 pharmaceutical compositions to administer to a human for the treatment of tuberculosis. 

Particular embodiments of the invention are polynucleotides encoding a FATP 
of Cochliobolus (Helminthosporium) heterostrophus or portions or variants thereof, the 
isolated or recombinantly produced FATP, methods for assessing whether an agent 
binds to the chFATP, and further methods for assessing the effect of an agent being 

2 5 tested for its ability to modulate fatty acid transport activity. Cochliobolus 

heterostrophus is an ascomycete that is the cause of southern com leaf blight, an 
economically important threat to the com crop in the United States. The related species 
C sativus causes crown rot and common root rot in wheat and barley. One or more 
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FATPs of C. heterosirophus can be targeted for the identification of an inhibitor of 
chFATP function, which can be then be used as an agent effective against infection of 
plants by C. heterostrophus and related organisms. Methods described herein that were 
applied in studying the expression of a FATP gene and the function of the FATP in its 
5 natural site of expression or in a host cell, can be used in the study of the chFATP gene 
and protein. 

Magnaporthe grisea (rice blast) is an economically important fungal pathogen of 
rice. Further embodiments of the invention are nucleic acid molecules encoding a 
FATP of Magnaporthe gi-isea, portions thereof, or variants thereof, isolated mgFATP, 

1 o nucleic acid constructs, and engineered cells expressing mgFATP. Other aspects of the 

invention are assays to identify an agent which binds to mgFATP and assays to identify 
an agent which modulates the function of mgFATP in cells in which mgFATP is 
expressed or in artificial membrane systems. Agents identified as inhibiting mgFATP 
activity can be developed into anti-fungal agents to be used to treat rice infected with 
15 rice blast. 

Caenorhabditis elegans is a nematode related to plant pathogens and human 
parasites. An isolated polynucleotide which encodes ceFATP, like other 
polynucleotides encoding FATPs of the FATP family described herein, can. be 
incorporated into nucleic acid vectors and other constructs that can be used in various 

2 0 types of cells to produce ceFATP. ceFATP as it occurs in cells or as it can be isolated 

or incorporated into various artificial or reconstructed membrane systems, can be used 
to assess fatty acid transport, and to identify ligands and agents that modulate fatty acid 
transport activity. Agents found by such assays to be inhibitors of ceFATP activity can 
be incorporated into compositions for the treatment of diseases caused by genetically 
2 5 related organisms with a FATP of similar sensitivity to the agents. 

Aspergillus nidulans is one of a family of fungal species that can infect humans. 
Further embodiments of the invention of the family of polynucleotides encoding FATPs 
are polynucleotides encoding a FATP of Aspergillus nidulans, and vectors and host 
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cells that can be constructed to comprise such polynucleotides. Further embodiments 
are a polypeptide encoded by such polynucleotides, portions thereof having one or more 
functions characteristic of a FATP, and various methods. The methods include.those 
for identifying agents that bind to anFATP and those for assessing the effect of an agent 
5 being tested for its ability to modulate fatty acid transport activity. Those agents found 
to inhibit fatty acid transport function can be used in compositions as anti-fungal 
pharmaceuticals, or can be modified for greater effectiveness as a pharmaceutical. 

One aspect of the invention relates to isolated nucleic acids that encode a FATP 
as described herein, such as those FATPs having an amino acid sequence in Figure 45 

1 0 (SEQ ID NO:47)„Figure 47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), Figure 51 
(SEQ ID NO:53), Figures 94A and 94B (SEQ ID NO: 102), and Figure 55 (SEQ ID 
NO: 57) and nucleic acids closely related thereto as described herein. 

Using the information provided herein, such as a nucleic acid sequence set forth 
in Figures 44A-44C (SEQ ID NO:46), Figures 46A and 46B (SEQ ID NO:48), Figure 

15 48 (SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ 
ID NO: 101), and Figures 54A-54C (SEQ ID NO:56), a nucleic acid of the invention 
encoding a FATP polypeptide may be obtained using standard cloning and screening 
methods, such as those for cloning and sequencing cDNA library fragments, followed 
by obtaining a full length clone. For example, to obtain a nucleic acid of the invention, 

2 0 a library of clones of cDNA of human or other mammalian DNA can be probed with a 
labeled oligonucleotide, such as a radiolabeled oligonucleotide, preferably about 17 
nucleotides or longer, derived from a partial sequence. Clones carrying DNA identical 
to that of the probe can then be distinguished using stringent (also, "high stringency") 
hybridization conditions. By sequencing the individual clones thus identified with 

2 5 sequencing primers designed from the original sequence it is then possible to extend the 
sequence in both directions to determine the full length sequence. Suitable techniques 
are described, for example, in Current Protocols in Molecular Biology (F.M. Ausubel et 
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al, eds), containing supplements through Supplement 42, 1998, John Wiley and Sons, 
Inc., especially chapters 5, 6 and 7. 

Embodiments of the invention include isolated nucleic acid molecules 
comprising any of the following nucleotide sequences: i.) a nucleotide sequence which 
5 encodes a protein comprising the amino acid sequence of hsFATPl (SEQ .ID NO:47), 
the amino acid sequence of hsFATP2 (SEQ ID NO:49), the amino acid sequence of 
hsFATP3 (SEQ ID NO:51) > the amino acid sequence of hsFATP4 (SEQ ID NO: 53), the 
amino acid sequence of hsFATPS (SEQ ID NO:102) or the amino acid sequence of 
hsFATP6 (SEQ ID NO:57); Z) nucleotide sequences of hsFATPl, hsFATP2, 

10 hsFATP3, hsFATP4, hsFATPS, or hsFATP6 (SEQ ID NO:46, 48, 50, 52, 1 01, or 56, 
respectively); 3.) a nucleotide sequence which is complementary to the nucleotide 
sequence of hsFATPl (SEQ ID NO:46), hsFATP2 (SEQ ID NO:48), hsFATP3 (SEQ 
ID NO:50), hsFATP4 (SEQ ID NO:52), hsFATP5 (SEQ ID NO: 101) or hsFATP6 (SEQ 
ED NO:56); 4.) a nucleotide sequence which consists of the coding region of hsFATPl 

15 (SEQ ID NO:46), the coding region of hsFATP2 (SEQ ID NO:48), the coding region of 
hsFATP3 (SEQ ID NO:50), the coding region of tisFATP4 (SEQ ID NO:52), the coding 
region of hsFATPS (SEQ ID NO:101), or the coding region of hsFATP6 (SEQ ID 
NO:56). 

The invention further relates to nucleic acids (nucleic acid molecules or 
2 0 polynucleotides) having nucleotide sequences identical over their entire length to those 
shown in the figures, for instance Figures 44A-44C (SEQ ID NO:46), Figures 46A and 
46B (SEQ ID NO:48), Figure 48 (SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), 
Figures 94A and 94B (SEQ ID NO:101), and Figures 54A-54C (SEQ ID NO:56> It 
further relates to DNA, which due to the degeneracy of the genetic code, encodes a 
2 5 F ATP encoded by one of the FATP-encoding DNAs, whose amino acid sequence is 
provided herein. Also provided by the invention are nucleic acids having the coding 
sequences for the mature polypeptides or fragments in reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- or 
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prepro- protein sequence. The nucleic acids of the invention encompass nucleic acids, 
that include .a single continuous region or discontinuous regions encoding the 
polypeptide, together with additional regions, that may also contain coding or non- 
coding sequences. The nucleic acids may also contain non-coding sequences, including, 
5 for example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, 
non-translated sequences, termination signals, ribosome binding sites, sequences that 
stabilize mRNA, introns, polyadenylation signals, and additional coding sequences 
which encode additional amino acids. For example, a marker sequence that facilitates 
purification of the fused polypeptide can be encoded. In certain embodiments of the 

1 0 invention, the marker sequence can be a hexa-histidine peptide, as provided in the pQE 
vector (Qiagen, Inc.) and described in Gentz et ai, Proc. Natl Acad. Set USA 86: 821- 
824 (1989), or an HA tag (Wilson et al, Cell 37: 767 (1984)), or a sequence encoding 
glutathione S-transferase of Schistosoma japonicum (vectors available from Pharmacia; 
see Smith, D.B. and Johnson K.S., Gene 57:31 (1988) andlCaelin, W.G. et al, Cell 

15 70:351 (1992)). Nucleic acids of the invention also include, but are not limited to, 
nucleic acids comprising a structural gene and its naturally associated sequences that 
control gene expression. 

The invention further relates to variants, including naturally-occuiring allelic 
variants, of those nucleic acids described specifically herein by DNA sequence, that 

2 0 encode variants of such polypeptides as those having the amino acid sequences shown 
in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), 
Figure 51 (SEQ ID NO:53) Figures 94A and 94B (SEQ ID NO: 102), or Figure 55 (SEQ 
ID NO:57), Such variants include nucleic acids encoding variants of the above-listed 
amino acid sequences, wherein those valiants have several, such as 5 to 10, 1 to 5, or 3, 

2 5 2 or 1 amino acids substituted, deleted, or added, in any combination. Variants include 
polynucleotides encoding polypeptides with at least 95% but less than 100% amino acid 
sequence identity to the polypeptides described herein by amino acid sequence. Variant 
polynucleotides hybridize, under low to high stringency conditions, to the alleles 



WO 99/36537 



PCT/US99/00182 



-24- 

described herein by DNA sequence. In one embodiment, variants have silent 
substitutions, additions and deletions that do not alter the properties and activities of the 
FATP Allelic variants of the polynucleotides encoding hsFATPl (Figure 45; SEQ ID 
NO:47), hsFATP2 (Figure 47; SEQ ID NO:49), hsFATP3 (Figure 49; SEQ ID NO:5 1), 
5 hsFATP4 (Figure 51; SEQ ID NO:53), Figures 94A and 94B (SEQ ID NO:102) and 
hsFATP6 (Figure 55; SEQ ID NO:57) will be identified as mapping to chromosomal 
locations listed for the corresponding wild type genes in Table 2 in Example 1 . 

Orthologous genes are gene loci in different species that are sufficiently similar 
to each other in their nucleotide sequences to suggest that they originated from a 
10 ' common ancestral gene. Orthologous genes arise when a lineage splits into two species, 
rather than when a gene is duplicated within a genome. Proteins that are orthologs are 
encoded by genes of two different species, wherein the genes are said to be orthologous. 

The invention further relates to polynucleotides encoding polypeptides which 
are orthologous to those polypeptides having a specific amino acid sequence described 
15 herein, such as the amino acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 
47 (SEQ ID NO:49), Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID NO:53), Figures 
94A and 94B (SEQ ID NO: 102), or Figure 55 (SEQ ID NO:57). These polynucleotides, 
which can be called ortholog polynucleotides, encode orthologous polypeptides that can 
range in amino acid sequence identity to a reference amino acid sequence described 
2 0 herein, from about 65% to less than 1 00%, but preferably 70% to 80%, more preferably 
-80% to 90%, and still more preferably 90% to less than 100%. Orthologous 
polypeptides can also be those polypeptides that range in amino acid sequence similarity 
to a reference amino acid sequence described herein from about 75% to 100%, within 
the signature sequence. The ammo acid sequence similarity between the signature 
2 5 sequences of orthologous polypeptides is preferably 80%, more preferably 90%, and 

still more preferably, 95%. The ortholog polynucleotides encode polypeptides that have 
similar functional characteristics (e.g., fatty acid transport activity) and similar tissue 
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distribution, as appropriate to the organism from which the ortholog polynucleotides 
can be isolated. 

Ortholog polynucleotides can be isolated from (e.g. , by cloning or nucleic acid 
amplification methods) a great number of species, as shown by the sample of FATPs 
5 from evolutionary divergent species described herein (see, e.g., Figures 44A-C through 
Figure 89). Ortholog polynucleotides corresponding to those in Figure 45 (SEQ ID 
NO:47), Figure 47 (SEQ ID NO;49), Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID 
NO:53), Figures 94A and 94B (SEQ ID NO:102) and Figure 55 (SEQ ID NO:57) are 
those which can be isolated from mammals such as rat, dog, chimpanzee, monkey, 

1 0 . baboon, pig, rabbit and guinea pig, for example. 

Further variants that are fragments of the nucleic acids of the invention may be 
used to synthesize full-length nucleic acids of the invention, such as by use as primers in 
a polymerase chain reaction. As used herein, the term primer refers to a single-stranded 
oligonucleotide which acts as a point of initiation of template-directed DNA synthesis 

1 5 under appropriate conditions (e.g., in the presence of four different nucleoside 

triphosphates and an agent for polymerization, such as DNA or RNA polymerase ox- 
reverse transcriptase) in an appropriate buffer and at a suitable temperature. The 
appropriate length of a primer depends on the intended use of the primer, but typically 
ranges from 15 to 30 nucleotides. Short primer molecules generally require cooler 

2 0 temperatures to form sufficiently stable hybrid complexes with the template. A primer 
need not reflect the exact sequence of the template, but must be sufficiently 
complementary to hybridize with a template. The term primer site refers to the area of 
the target DNA to which a primer hybridizes. The term primer pair refers to a set of 
primers including a 5' (upstream) primer that hybridizes with the 5 ! end of the DNA 

2 5 sequence to be amplified and a 3' (downstream) primer that hybridizes with the 
complement of the 3' end of the sequence to be amplified. 

Further embodiments of the invention are nucleic acids that are at least 80% 
identical over their entire length to a nucleic acid described herein, for example a 
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nucleic acid having the nucleotide sequence in Figures 44A-44C (SEQ ID NO:46), 
Figures 46A-46B (SEQ ID NO:48), Figure 48 (SEQ ID NO:50) } Figures 50A-50C 
(SEQ ID NO:52), Figures 94A and 94B (SEQ ID NO: 101), and Figures 54A-54C (SEQ 
ID NO: 5 6). Additional embodiments are nucleic acids, and the complements of such 
5 nucleic acids, having at least 90% nucleotide sequence identity to the above-described 
sequences, and nucleic acids having at least 95% nucleotide sequence identity. In 
preferred embodiments, DNA of the present invention has 97% nucleotide sequence 
identity, 98% nucleotide sequence identity, or at least 99% nucleotide sequence identity 
with the DNA whose sequences are presented herein. 

1 0 Other embodiments of the invention are nucleic acids that are at least 80% 

identical in nucleotide sequence to a nucleic acid encoding a polypeptide having an 
amino acid sequence as set forth in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID 
NO;49), Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID NO:53), Figures 94A and 94B 
(SEQ ID NO: 102) or Figure 55 (SEQ ID NO:57), or as such amino acid sequences are 

15 set forth elsewhere herein, and nucleic acids that are complementary to such nucleic 

acids. Specific embodiments are nucleic acids having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide having an amino acid sequence as 
described in the list above, nucleic acids having at least 95% sequence identity, and 
nucleic acids having at least 97% sequence identity. 

2 o The terms "complementary" or "complementarity" as used herein, refer to the 

natural binding of polynucleotides under permissive salt and temperature conditions by 
base-pairing. Complementarity between two single-stranded molecules may be 
"partial" in which only some of the nucleic acids bind, or it may be complete when total 
complementarity exists between the single-stranded molecules (that is, when A-T and 

2 5 C-C base pairing is 1 00% complete). The degree of complementarity between nucleic 
acid strands has significant effects on the efficiency and strength of hybridization 
between nucleic acid strands. This is of particular importance in amplification 
reactions, which depend on binding between nucleic acid strands. 
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' The invention further includes nucleic acids that hybridize to the above- 
described nucleic acids, especially those nucleic acids that hybridize under stringent 
hybridization conditions. "Stringent hybridization conditions" or "high stringency 
conditions" generally occur within a range from about T m minus 5 °C (5 ° C below the 
5 strand dissociation temperature or melting temperature (T m ) of the probe nucleic acid 
molecule) to about 20° C to 25 ° C below T m . As will be understood by those of skill in 
the art, the stringency of hybridization may be altered in order to identify or detect 
molecules having identical or related polynucleotide sequences. An example of high 
stringency hybridization follows. Hybridization solution is (6x SSC/1 0 mM ■ 

10 EDTA/0.5% SDS/5x Denhardt's solution/100 fig/ml sheared and denatured salmon 

sperm DNA). Hybridization is at 64-65 °C for 16 hours. The hybridized blot is washed 
two times with 2x SSC/0.5% SDS solution at room temperature for 15 minutes each, 
and two times with 0.2x SSC/0.5% SDS at 65 °C, for one hour each. Further examples 
of high stringency conditions can be found on pages 2.10.1-2.10.16 (see particularly 

15 2.10.8-1 1) and pages 6.3.1-6 in Current Protocols in Molecular Biology (Ausubel, F.M. 
et aU eds., containing supplements up through Supplement 42, 1998). Examples of 
high, medium, and low stringency conditions can be found on pages 36 and 37 of WO 
98/40404, which are incorporated herein by reference. 

The invention further relates to nucleic acids obtainable by screening an 

2 0 appropriate library with a probe having a nucleotide sequence such as that set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), -Figure 48 
(SEQ ID NO:50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ ID 
NO:101) or Figures 54A-54C (SEQ ID NO:56), or a probe which is a sufficiently long 
fragment of any of the above; and isolating the nucleic acid. Such probes generally can 

2 5 comprise at least 1 5 nucleotides. Nucleic acids obtainable by such screenings may 

include RNAs, cDNAs and genomic DNA, for example, encoding FATPs of the FATP 
family described herein. 
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Further uses for the nucleic acid molecules of the invention, whether encoding a 
full-length FATP or whether comprising a contiguous portion of a nucleic acid molecule 
such as one given in SEQ ID NO:46, 48, 50, 52, 101, or 56, include use as markers for 
tissues in which the corresponding protein is preferentially expressed (to identify 
5 constitutively expressed proteins or proteins produced at a particular stage of tissue 
differentiation or stage of development of a disease state); as molecular weight markers 
on southern gels; as chromosome markers or tags (when labeled, for example with 
biotin, a radioactive label or a fluorescent label) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in a mammal to 

1 0 identify potential genetic disorders; as probes to hybridize and thus identify, related 
DNA sequences; as a source of information to derive PCR primers for genetic 
fingerprinting; as a probe to "subtract-out 11 known sequences in the process of 
discoveiing other novel nucleic acid molecules; for selecting and making oligomers for 
attachment to a "gene chip" or other support, to be used, for example, for examination 

15 of expression patterns; to raise anti-protein antibodies using DNA immunization 

techniques; and as an antigen to raise anti-DNA antibodies or to elicit another immune 
response. 

Further methods to obtain nucleic acids encoding FATPs of the FATP family 
• include PCR and variations thereof (e.g., "RACE" PCR and semi-specific PCR 
2 0 methods). Portions of the nucleic acids having a nucleotide sequence set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), Figure 48 
(SEQ ID NO;50), Figures 50A-50C (SEQ ID NO:52), Figures 94A and 94B (SEQ ID 
NO:101) or Figures 54A-54C (SEQ ID NO:56), (especially "flanking sequences" on 
either side of a coding region) can be used as primers in methods using the polymerase 
2 5 chain reaction, to produce DNA from an appropriate template nucleic acid. 

Once a fragment of the FATP gene is generated by PCR, it can be sequenced, 
and the sequence of the product can be compared to other DNA sequences, for example, 
by using the BLAST Network Service at the National Center for Biotechnology 
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Information. The boundaries of the open reading frame can then be identified using ' 
semi-specific PCR or other suitable methods such as library screening. Once the 5' 
initiator methionine codon and the V stop codon have been identified, a PCR product 
encoding the full-length gene can be generated using genomic DNA as a template, with 
5 primers complementary to the extreme 5' and 3* ends of the gene or to their flanking 
sequences. The full-length genes can then be cloned into expression vectors for the 
production of functional proteins. 

The invention also relates to isolated proteins or polypeptides such as those 
encoded by nucleic acids of the present invention. Isolated proteins can be purified 
1 0 from a natural source or can be made recombinantly. Proteins or polypeptides referred 
to herein as "isolated" are proteins or polypeptides that exist in a state different from the 
state in which they exist ha cells in which they are normally expressed in an organism, 
and include proteins or polypeptides obtained by methods described herein, similar- 
methods or other suitable methods, and also include essentially pure proteins oi- 
ls polypeptides, proteins or polypeptides produced by chemical synthesis or by 
combinations of biological and chemical methods, and recombinant proteins or 
polypeptides which are isolated. Thus, the term "isolated" as used herein, indicates that 
the polypeptide in question exists in a physical milieu distinct from that in which it 
occurs in nature. Thus, "isolated" includes existing in membrane fragments and vesicles 
2 0 membrane fractions, liposomes, lipid bilayers and other artificial membrane systems. 
An isolated FATP may be substantially isolated with respect to the complex cellular 
milieu in which it naturally occurs, and may even be purified essentially to 
homogeneity, for example as determined by PAGE or column chromatography (for 
example, HPLC), but may also have further cofactors or molecular stabilizers, such as 
2 5 detergents, added to the purified protein to enhance activity. In one embodiment, 

proteins or polypeptides are isolated to a state at least about 75% pure; more preferably 
at least about 85% pure, and still more preferably at least about 95% pure, as determined 
by Coomassie blue staining of proteins on SDS-polyacrylamide gels. Proteins or 
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polypeptides refeiTed to herein as "recombinant" are proteins or polypeptides produced 
by the expression of recombinant nucleic acids. 

In a preferred embodiment, an isolated polypeptide comprising a FATP, a 
functional portion thereof, or a functional equivalent of the FATP, has at least one 
5 function characteristic of a FATP, for example, transport activity, binding function (e.g., 
a domain which binds to AMP), or antigenic function (e.g., binding of antibodies that 
also bind to a naturally-occurring FATP, as that function is found in an antigenic 
determinant). Functional equivalents can have activities that are quantitatively similar 
to, greater than, or less than, the reference protein. These proteins include, for example, 
1 0 naturally occurring FATPs that can be purified from tissues in which they are produced 
(including polymorphic or allelic variants), variants (e.g., mutants) of those proteins 
and/or portions thereof. Such variants include mutants differing by the addition, 
deletion or substitution of one or more amino acid residues, or modified polypeptides in 
which one or more residues are modified, and mutants comprising one or more modified 
15 residues. Portions or fragments of a FATP can range in size from four amino acid 
residues to the entire amino acid sequence minus one amino acid. 

The isolated proteins of the invention preferably include mammalian fatty acid 
transport proteins of the FATP family of homologous proteins. In one embodiment, the 
extent of amino acid sequence similarity between a polypeptide having one of the amino 
2 0 acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), 

Figure 49 (SEQ ID NO:51), Figure 51 (SEQ ID NO:53), Figures 94A and 94B (SEQ ID 
NO: 102), or Figure 55 (SEQ ID NO:57), and the respective functional equivalents of 
these polypeptides is at least about 88%. In other embodiments, the degree of amino 
acid sequence similarity between a FATP and its respective functional equivalent is at 
25 least about 91%, at least about 94%, or at least about 97%. 

The polypeptides of the invention also include those FATPs encoded by 
polynucleotides which are orthologous to those polynucleotides, the sequences of which 
are described herein in whole or in part. FATPs which are orthologs to those described 
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herein by amino acid sequence, in whole or in part, are, for example fatty acid transport 
proteins 1-6 of dog, rat chimpanzee, monkey, rabbit, guinea pig, baboon and pig, and 
are also embodiments of the invention. 

To determine the percent identity or similarity of two amino acid sequences or 
5 of two nucleic acid sequences, the sequences are aligned for optimal comparison 

purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid 
or nucleic acid sequence for optimal alignment, and non-homologous (dissimilar) 
sequences can be disregarded for comparison purposes). In a preferred embodiment, the 
length of a reference sequence aligned for comparison purposes is at least 30%, 

1 0 preferably at least 40%, more preferably at least 50%, even more preferably at least 

60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference 
sequence. The amino acid residues or nucleotides at corresponding amino acid 
positions or nucleotide positions are then compared. When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 

15 position in the second sequence, then the molecules are identical at that position (as 

used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic 
acid "similarity"). The percent identity between the two sequences is a function of the 
number of identical positions shared by the sequences, taking into account the number 
of gaps, and the length of each gap, which need to be introduced for optimal alignment 

2 0 of the two sequences. 

The invention also encompasses polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 
performed by the polypeptides described herein by amino acid sequence. Similarity for 
* a polypeptide is- determined by conserved amino acid substitution. Such svibstitutions 

2 5 are those that substitute a given amino acid in a polypeptide by another amino acid of 
like characteristics. Conservative substitutions are likety to be pheno typically silent. 
Typically seen as conservative substitutions are the replacements, one for another, 
among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxy] 
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residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino 
acid changes axe likely to be phenotypically silent is found in Bowie et al, Science 
5 247:1306-1310(1990). 
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TABLE 1 . Conservative Amino Acid Substitutions 



Aromatic 




Phenylalanine 








Tryptophan 








Tyrosine 




Hydrophobic 




Leucine 








Isoleucine 








Valine 




Polar 




Glutamine 








Asparagine 




Basic 




Arginine 








Lysine 








Histidine 




Acidic 




Aspartic Acid 








Glutamic Acid 




Small 




Alanine 








Serine 








Threonine 








Methionine 








Glycine 





The comparison of sequences and determination of percent identity and 
similarity between two sequences can be accomplished using a mathematical algorithm. 
1 0 {Computational Molecular Biology, Lesk, A.M.,ed., Oxford University Press, New 
York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part l t 
Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 



WO 99/36537 



PCT/US99/00182 



-34- 

Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 
Analysis Primer, Gribskov, M. and Devereaux, J., eds., M. Stockton 
Press, New York, 1991). In a preferred embodiment, the percent identity between two 
amino acid sequences is deteniiined using the Needleman and Wunsch (J. MoL Biol 
5 (48);444-453 (1970)) algorithm which has been incorporated into the GAP program in 
the GCG software package (available at http://www.gcg.com), using either a Blossom 
62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined using the GAP program in the 

1 0 GCG software package (Devereux, J., et al, Nucleic Acids Res. 12(1):3Z1 (1984)) 

(available at http://www.gcg:com), using a NWSgapdna.CMP matrix and a gap weight 
of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is 
determined using the algorithm of E. Meyers and W. Miller (CABIOS. 4:11-17 (1989)) 

15 which has been incorporated into the ALIGN program (version 2.0), using a PAM120 
weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
used as a "query sequence" to perform a search against databases to, for example, 
identify other family members or related sequences. Such searches can be performed 

2 0 using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (J. MoL 
Biol 275:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, word length = 12 to obtain nucleotide sequences 
homologous to (with calculatably significant similarity to) the nucleic acid molecules of 
the invention. BLAST protein searches can be performed with the XBLAST program, 

2 5 score = 50, word length = 3 to obtain amino acid sequences homologous to the proteins 
of the invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al t (Nucleic Acids Res. 2J(17):33S9- 
3402 (1997)). When utilizing BLAST and gapped BLAST programs, the default 
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parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http ://wvAV.ncbi.nlm.nih. gov. 

Similarity for nucleotide and amino acid sequences can be defined in terms of 
the parameters set by the Advanced Blast search available from NCBI (the National 
5 Center for Biotechnology Information; see, for Advanced BLAST page, 

www.ncbi.nlra.nih.gov/cgi-biiT/BLAST/nph-newblast?Jform=l). These default 
parameters, recommended for a query molecule of length greater than. 85 amino acid 
residues or nucleotides have been set as follows: gap existence cost, 11, perresidue gap 
cost, 1 ; lambda ratio, 0.85. Further explanation of version 2.0 of BLAST can be found 
1 0 on related website pages and in Altschul, S .F. et aL, Nucleic Acids Res. 25:32 89-3402 
(1997). 

The invention further relates to fusion proteins, comprising a FATP or 
functional portion thereof (as described above) as a first moiety, linked to second 
moiety not occurring in the FATP as found in nature. Thus, the second moiety can be 

1 5 an amino acid, peptide or polypeptide. The first moiety can be in an N-temiinal 

location, C-tenninal location or internal to the fusion protein. In one embodiment, the 
fusion protein comprises a FATP as the first moiety, and a second moiety comprising a 
linker sequence and an affinity ligand. Fusion proteins can be produced by a variety of 
methods. For example, a fusion protein can be produced by the insertion of a FATP 

2 0 gene or portion thereof into a suitable expression vector, such as Bluescript SK +/- 
(Stratagene), pGEX-4T-2 (Pharmacia), pET-24(+) (Novagen), or vectors of similar 
construction. The resulting construct can be introduced into a suitable host cell for 
expression. Upon expression, fusion protein can be purified from cells by means of a 
suitable affinity matrix (See e.g., Current Protocols in Molecular Biology, Ausubel, 

2 5 F.M. et al. t eds., Vol. 2, pp. 16.4.1-16.7.8, containing supplements up through 
Supplement 42, 1998), 

The invention also relates to enzymatically produced, synthetically produced, or 
recombmantly produced portions of a fatty acid transport protein. Portions of a FATP 
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can be made which have full or partial function on their own, or which when mixed . 
together (though fully, partially, or nonfunctional alone), spontaneously assemble with 
one or more other polypeptides to reconstitute a functional protein having at least one 
function characteristic of a FATP. 
5 Fragments of a FATP can be produced by direct peptide synthesis, for example 

those using solid-phase techniques (Roberge, J.Y. et al, Science 2(59:202-204 (1995); 
Merrificld, J., J. Am. Chem. Soc. 55:2149-2154 (1963)). Protein synthesis can be 
performed using manual techniques or by automation. Automated synthesis can be 
carried out using, for instance, an Applied Biosystems 431 A Peptide Synthesizer 
10 (Perkin Elmer). Various fragments of a FATP can be synthesized separately and 
combined using chemical methods. / 

One aspect of the invention is a peptide or polypeptide having the amino acid 
sequence of a portion of a fatty acid transport protein which is hydrophilic rather than 
hydrophobic, and ordinarily can be detected as facing the outside of the cell membrane. 
1 5 Such a peptide or polypeptide can be thought of as being an extracellular domain of the 
FATP, or a mimetic of said extracellular domain. It is known, for example, that a 
portion of human FATP4 that includes a highly conserved motif is involved in AMP- 
CoA binding function (Stuhlsatz-Krouper, S.M. et al, 1 Biol Chem. 44:28642-28650 
(1998)). 

2 o The term "mimetic" as used herein, refers to a molecule, the structure of which 

is developed from knowledge of the structure of the FATP of interest, or one or more 
portions thereof, and, as such, is able to effect some or all of the functions of a FATP. 

Portions of an FATP can be prepared by enzymatic cleavage of the isolated 
protein, or can be made by chemical synthesis methods. Portions of a FATP can also be 

2 5 made by recombinant DNA methods in which restriction fragments, or fragments that 
may have undergone further enzymatic processing, or synthetically made DNAs are 
joined together to construct an altered FATP gene. The gene can be. made such that it 
encodes one or more desired portions of a FATP. These portions of FATP can be 
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entirely homologous to a known FATP, or can be altered in amino acid sequence 
relative to naturally occurring FATPs to enhance or introduce desired properties such as 
solubility, stability, or affinity to a ligand. A further feature of the gene can be a 
sequence encoding an N-terminal signal peptide directed to the plasma membrane. 
5 An extracellular domain can be determined by a hydrophobicity plot, such as 

those shown in Figures 28A, 29A, and 35A, or by a hydrophilicity plot such as those 
shown in Figures 28C, 29C, 35C, 91, 92 and 93. A polypeptide or peptide comprising 
all or a portion of a FATP extracellular domain can be used in a pharmaceutical 
composition. When administered to a mammal by an appropriate route, the polypeptide 

10 or peptide can bind to fatty acids and compete with the native FATPs in the membrane 
of cells, thereby malting fewer fatty acid molecules available as substrates for transport 
into cells, and reducing the amount of fatty acids taken up by, for example, the heart, in 
thecaseofFATP6. 

Another aspect of the invention relates to a method of producing a fatty acid 

15 transport protein, valiants or portions thereof, and to expression systems and host cells 
containing a vector appropriate for expression of a fatty acid transport protein. 

Cells that express a FATP, a variant or a portion thereof, or an ortholog of a 
FATP described herein by amino acid sequence, can be made and maintained in culture, 
under conditions suitable for expression, to produce protein in the cells for cell-based 

2 0 assays, or to produce protein for isolation. These cells can be procaryotic or eucaryotic. 
Examples of procaryotic cells that can be used for expression include Escherichia coli, 
Bacillus subtilis and other bacteria. Examples of eucaryotic cells that can be used for 
expression include yeasts such as Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Pichia pastoris and other lower eucaryotic cells, and cells of higher eucaryotes 

2 5 such as those from insects and mammals, such as primary cells and cell lines such as 

CHO, HeLa, 3T3 and BHK cells, preferably COS cells and human kidney 293 cells, and 
more preferably Jurkat cells. (See, e.g., Ausubel, F.M. et al 9 eds. Current Protocols in 



WO 99/36537 



PCT7US99/00182 



-38- 

Molecular Biology, Greene Publishing Associates and John Wiley & Sons, Inc., 
containing Supplements tip through Supplement 42, 1998)). 

In one embodiment, host cells that produce a recombinant FATP, or a portion 
thereof, a variant, or an ortholog of a FATP described herein by amino acid sequence, 
5 can be made as follows. A gene encoding a FATP, variant or a portion thereof can be 
inserted into a nucleic acid vector, e.g., a DNA vector, such as a plasmid, phage, 
cosmid, phagemid, virus, virus-derived vector (e.g., SV40, vaccinia, adenovirus, fowl 
pox virus, pseudorabies viruses, retroviruses) or other suitable replicon, which can be 
present in a single copy or multiple copies, or the gene can be integrated in a host cell 
10 clxromosome. A suitable replicon or integrated gene can contaiti all or part of the 
coding sequence for a FATP or variant, operably linked to one or more expression 
control regions whei*eby the coding sequence is under the control of transcription 
signals and linked to appropriate translation signals to permit translation. The vector 
can be introduced into cells by a method appropriate to the type of host cells (e.g., 
15 transfection, electroporation, infection). For expression from the FATP gene, the host 
cells can be maintained under appropriate conditions (e.g., in fee presence of inducer, 
normal, growth conditions, etc.). Proteins or polypeptides thus produced can be 
recovered (e.g., from the cells, as in a membrane fraction, from the periplasmic space of 
bacteria, from culture medium) using suitable techniques. Appropriate membrane 
2 0 targeting signals may be incorporated into the expressed polypeptide. These signals 
may be endogenous to the polypeptide or they may be heterologous signals. 

Polypeptides of the invention can be recovered and purified from cell cultures 
(or from their primary cell source) by well-known methods including ammonium sulfate 
or ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
2 5 phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and high performance liquid 
chromatography. Known methods for refolding protein can be used to regenerate active 
conformation if the polypeptide is denatured during isolation or purification. 
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In a further aspect of the invention are methods for assessing the transport 
function of any of the fatty acid transport proteins or polypeptides described herein, 
including orthologs, and in variations of these, methods for identifying an inhibitor (or 
an enhancer) of such function and methods for assessing the transport function in the 
5 presence of a candidate inhibitor or a known inhibitor, 

A variety of systems comprising living cells can be used for these methods. 
Cells to be used in fatty acid transport assays, and .further in methods for identifying an 
inhibitor or enhancer of this function, express one or more FATPs. See Examples 3, 6, 
9, 12 and 14 for data on tissue distribution of expression of FATPs, and Examples 10 

10 and 1 1 describing recombinant cells expressing FATP. Cells for use in cell-based 

assays described herein can be drawn from a variety of sources, such as isolated primary 
cells of various organs and tissues wherein one or more FATPs are naturally expressed. 
In some cases, the cells can be from adult organs, and in some cases, from embryonic or 
fetal organs, such as heart, lung, liver, intestine, skeletal muscle, kidney and the like. 

15 Cells for this purpose can also include cells cultured as fragments of organs or in 
conditions simulating the cell type and/or tissue organization of organs, in which 
artificial materials may be used as substrates for cell growth. Other types of cells 
suitable for this purpose include cells of a cell strain or cell line (ordinarily comprising 
cells considered to be "transformed") transfected to express one or more FATPs. 

2 0 A further embodiment of the invention is a method for detecting, in a sample of 

cells, a fatty acid transport protein, a portion or fragment thereof, a fusion protein 
comprising a FATP or a portion thereof, or an ortholog as described herein, wherein tire 
cells can be, for instance, cells of a tissue, primary culture cells, or cells of a cell line, 
including cells into which nucleic acid has been introduced. The method comprises 

2 5 adding to the sample an agent that specifically binds to the protein, and detecting the 
agent specifically bound to the protein. Appropriate washing steps can be added to 
reduce nonspecific binding to the agent. The agent can be, for example, an antibody, a 
ligand or a substrate mimic. The agent can have incorporated into it, or have bound to 
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it, covalently or by high affinity non-covalent interactions, for instance, a label that 
facilitates detection of the agent to which it is bound, wherein the label can be, but is not 
limited to, a phosphorescent label, a fluorescent label, abiotin or avidin label, or a 
radioactive label. The means of detection of a fatty acid transport protein can vary, as 
5 appropriate to the agent and label used. For example, for an antibody that binds to the 
fatty acid transport protein, the means of detection may call for binding a second 
antibody, which has been conjugated to an enzyme, to the antibody which binds the 
fatty acid transport protein, and detecting the presence of the second antibody by means 
of the enzymatic activity of the conjugated enzyme. 

1 o Similar principles can also be applied to a cell lysate or a more purified 

preparation of proteins from cells that may comprise a fatty acid transport protein of 
interest, for example in the methods of immunoprecipitation, immunoblotting, 
immunoaffmity methods, that in addition to detection of the particular FATP, can also 
be used in purification steps, and qualitative and quantitative immunoassays. See, for 
1 5 instance, chapters 1 1- through 14 in Antibodies: A Laboratory* Manual, E. Harlow and 
D. Lane, eds., Cold Spring Harbor Laboratory, 1988. 

Isolated fatty acid transport protein or, an antigenically similar portion thereof, 
especially a portion that is soluble, can be used in a method to select and identify 
molecules which bind specifically to the FATP. Fusion proteins comprising all of, or a 

2 0 portion of, the fatty acid transport protein linked to a second moiety not occurring in the 

FATP as found in nature, can be prepared for use in another embodiment of the method. 
Suitable fusion proteins for this purpose include those in which the second moiety 
comprises an affinity ligand (e.g., an enzyme, antigen, epitope). FATP fusion proteins 
can be produced by the insertion of a gene encoding the FATP or a variant thereof, or a 
2 5 suitable portion of such gene into a suitable expression vector, which encodes an 

affinity ligand (e.g., pGEX-4T-2 andpET-15b, encoding glutathione S-transferase and 
His-Tag affinity ligands, respectively). The expression vector can be introduced into a 
suitable host cell for expression. Host cells are lysed and the lysate, containing fusion 
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protein/can be bound to a suitable affinity matrix by contacting the lysate with an 
affinity matrix. 

In one embodiment, the fusion protein can be immobilized on a suitable affinity 
matrix under conditions sufficient to bind the affinity ligand portion of the fusion 
5 protein to the matrix, and is contacted with one or more candidate binding agents (e"g., a 
mixture of peptides) to be tested, under conditions suitable for binding of the binding 
agents to the FATP portion of the bound fusion protein. Next, the affinity matrix with 
bound fusion protein can be washed with a suitable wash buffer to remove unbound 
candidate binding agents and non-specifically bound candidate binding agents. Those 

1 0 agents which remain bound can be released by contacting the affinity matrix with fusion 
protein bound thereto with a suitable elution buffer. Wash buffer can be formulated to 
permit binding of the fusion protein to the affinity matrix, without significantly 
disrupting binding of specifically bound binding agents. In this aspect, elution buffer 
can be formulated to permit retention of the fusion protein by the affinity matrix, but 

15 can be formulated to interfere with binding of the candidate binding agents to the target 
portion of the fusion protein. For example, a change in the ionic strength or pH of the 
elution buffer can lead to release of specifically bound agent, or the elution buffer can 
comprise a release component or components designed to disrupt binding of specifically 
bound agent to the target portion of the fusion protein. 

2 0 Immobilization can be performed prior to, simultaneous with, or after, 

contacting the fusion protein with candidate binding agent, as appropriate. Various 
permutations of the m ethod are possible, depending upon factors such as the candidate 
molecules tested, the affinity matrix-ligand pair selected, and elution buffer formulation. 
For example, after, the wash step, fusion protein with binding agent molecules bound 

2 5 thereto can be eluted from the affinity matrix with a suitable elution buffer (a matrix 
elution buffer, such as glutathione for a GST fusion). Where the fusion protein 
comprises a cleavable linker, such as a thrombin cleavage site, cleavage from the 
affinity ligand can release a portion of the fusion with the candidate agent bound 
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tliereto. Bound agent molecules can then be released from the fusion protein or its 
cleavage product by an appropriate method, such as extraction. 

One or more candidate binding agents can be tested simultaneously. Where a 
mixture of candidate binding agents is tested, those found to bind by the foregoing 
5 processes can be separated (as appropriate) and identified by suitable methods (e.g., 
PCR, sequencing, chromatography). Large libraries of candidate binding agents (e.g., 
peptides, RNA oligonucleotides) produced by combinatorial chemical synthesis or by 
other methods can be tested (see e.g., Ohlmeyer, M.HJ. et aL, Proc. Natl Acad, ScL 
USA 10922-1 0926 (1993) andDeWitt, S.H. et aL, Proa Natl Acad. ScL USA 

10 90:6909-6913 (1993), relating to tagged compounds; see also Rutter, W.J. et aL U.S. 
Patent No. 5,010,175; Huebner, V.D. a/.,U.S. Patent No. 5,182,366; and Geysen, 
H.M., U.S. Patent No. 4,833,092). Random sequence RNA libraries (see Ellington, 
A.D. et aL, Nature 346:818-822 (1990); Bock, L.C. et al, Nature 355:584-566 (1992); 
and Szostak, J.W., Trends in Biochem. ScL 77:89-93 (March, 1992)) can also be 

1 5 screened according to the present method to select RNA molecules which bind to a 
target FATP or FATP fusion protein. Where binding agents selected from a 
combinatorial libraiy by the present method carry unique tags, identification of 
individual biomolecules by chromatographic methods is possible. Where binding 
agents do not carry tags, chromatographic separation, followed by mass spectrometry to 

2 O ascertain structure, can be used to identify binding agents selected by the method, for 
example. 

The invention also comprises a method for identifying an agent which inhibits 
interaction between a fatty acid transport protein (e.g., one comprising the amino acid 
sequence in SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:53, SEQ ID 
2 5 NO: 102, or SEQ ID NO:57), and a ligand of said protein. The FATP can be one 
described by amino acid sequence herein, a portion or fragment thereof, a variant 
thereof, or an oftholog thereof, or a FATP fusion protein. Here, a ligand can be, for 
instance, a substrate, or a substrate mimic, an antibody, or a compound, such as a 
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peptide, that binds with specificity to a site on the protein. The method comprises 
combining, not limited to a particular order, the fatty acid protein, the ligand of the 
protein, and a candidate agent to be assessed for its ability to inhibit interaction between 
the protein and the ligand, under conditions appropriate for interaction between the 
5 protein and the ligand (e.g., pH, salt, temperature conditions conducive to appropriate 
conformation and molecular interactions); determining the extent to which the protein 
and ligand interact; and comparing (1) the extent of protein-ligand interaction in the 
presence of candidate agent with (2) the extent of protein-ligand interaction in the 
absence of candidate agent, wherein if (1) is less than (2), then the candidate agent is 
1 0 one which inhibits interaction between the "protein and the ligand. 

The method can be facilitated, for example, by using an experimental system 
which employs a solid support (column chromatography matrix, wall of a plate, 
microtiter wells, column pore glass, pins to be submerged in a solution, beads, etc.) to 
which the protein can be attached. Accordingly, in one embodiment, the protein can be 
15 fixed to a solid phase directly or indirectly, by a linker. The candidate agent to be tested 
is added under conditions conducive for interaction and binding to the protein. The 
ligand is added to the solid phase system under conditions appropriate for binding. 
Excess ligand is removed, as by a series of washes done under conditions that do not 
disrupt protein-ligand interactions. Detection of bound ligand can be facilitated by 
2 0 using a ligand that carries a label (e.g., fluorescent, chemiluminescent, radioactive). In a 
control experiment, protein and ligand are allowed to interact in the absence of any 
candidate agent, under conditions otherwise identical to those used for the "test" 
conditions where candidate inhibiting agent is present, and any washes used in the test 
conditions are also used in the control. The extent to which ligand binds to the protein 
2 5 in the presence of candidate agent is compared to the extent to which ligand binds to the 
protein in the absence of tire candidate agent. If the extent to which interaction of the 
protein and the ligand occurs is less in tire presence of the candidate agent than in the 
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absence of the candidate agent, the candidate agent is an agent which inhibits interaction 
between the protein and the ligand of the protein. 

In a further embodiment, an inhibitor (or an enhancer) of a fatty acid transport 
protein can be identified. The method comprises steps which are, or are variations of 
5 the following: contacting the cells with fatty acid, wherein the fatty acid can be labeled 
for convenience of detection; contacting a first aliquot of the cells with an agent being 
tested as an inhibitor (or enhancer) of fatty acid uptake while maintaining a second 
aliquot of cells under the same conditions but without contact with the agent; and 
measuring (e.g., quantitating) fatty acid in the first and second aliquots of cells; wherein 
10 a lesser quantity of fatty acid in the first aliquot compared to that in the second aliquot 
is indicative that the agent is an inhibitor of fatty acid uptake by a fatty acid transport 
protein. A greater quantity of fatty acid in the first aliquot compared to f hat in the 
second aliquot is indicative that the agent is an enhancer of fatty acid uptake by a fatty 
acid transport protein. 

15 A particular embodiment of identifying an inhibitor or enhancer of fatty acid 

transport function employs the above steps, but also employs additional steps preceding 
those given above: introducing into cells of a cell strain or cell line ("host cells' 1 for the 
intended introduction of, or after the introduction of, a vector) a vector comprising a 
fatty acid transport protein gene, wherein expression of the gene can be regulatable or 

2 0 constitutive, and providing conditions to the host cells under which expression of the 
gene can occur. 

The terms "contacting" and "combining" as used herein in the context of 
bringing molecules into close proximity to each other, can be accomplished by 
conventional means. For example, when referring to molecules that are soluble, 
2 5 contacting is achieved by adding the molecules together in a solution. "Contacting" can 
also be adding an agent to a test system, such as a vessel containing cells in tissue 
culture. 
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The term "inhibitor" or "antagonist", as used herein, refers to an agent which . 
blocks, diminishes, inhibits, hinders, limits, decreases, reduces, restricts or interferes 
with fatty acid transport into the cytoplasm of a cell, or alternatively and additionally, 
prevents or impedes the cellular effects associated with fatty acid transport. The term 
5 "enhancer" or "agonist", as used herein, refers to an agent which augments, enhances, or 
increases fatty acid transport into the cytoplasm of a cell. An antagonist will decrease 
fatty acid concentration, fatty acid metabolism and byproduct levels in the cell, leading 
to phenotypic and molecular changes. 

In order to produce a "host cell" type suitable for fatty acid uptake assays and for 
1 0 assays derived therefrom for identifying inhibitors or enhancers thereof, a nucleic acid 
' vector can be constructed to comprise a gene encoding a fatty acid transport protein, for 
example, human FATP1, FATP2, FATP3, FATP4, FATP5, FATP6, a mutant or variant 
thereof, an ortholog of the human proteins, such as mouse orthologs or orthologs found 
in other mammals, or a FATP family protein of origin m an organism other than a 
15 mammal. The gene of the vector can be regulatable, such as by the placement of the 
gene under the control of an inducible or repressible promoter in the vector (e.g., 
inducible or repressible by a change in growth conditions of the host cell harboring the 
vector, such as addition of inducer, binding or functional removal of repressor from the 
cell millieu, or change in temperature) such that expression of the FATP gene can be 
2 0 turned on or initiated by causing a change in growth conditions, thereby causing the 
protein encoded by the gene to be produced, in host cells comprising the vector, as a 
plasma membrane protein. Alternatively, the FATP gene can be constitutive! y 
expressed. 

A vector comprising an FATP gene, such as a vector described herein, can be 
2 5 introduced into host cells by a means appropriate to the vector and to the host cell type. 
For example, commonly used methods such as electroporation, transfection, for 
instance, transfection using CaCl 2 , and transduction (as for a virus or bacteriophage) can 
be used. Host cells can be, for example, mammalian cells such as primary culture cells 
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or cells of cell lines such as COS cells, 293 cells or Jurkat cells. Host cells can also be, 
in some cases, cells derived fironi insects, cells of insect cell lines, bacterial cells, such 
as E. coii, or yeast cells, such as S. cerevisiae. It is preferred that the fatty acid transport 
protein whose function is to be assessed, with .or without a candidate inhibitor or 
5 enhancer, be produced in host cells whose ancestor cells originated in a species related 
to the species of origin of the FATP gene encoding the fatty acid transport protein. For 
example, it is preferable that tests of function or of inhibition or enhancement of a 
mammalian FATP be canied out in host mammalian cells producing the FATP, rather 
than bacterial cells or yeast cells. 

1 o Host cells comprising a vector comprising a regulatable FATP gene can be 

treated so as to allow expression of the FATP gene and production of the encoded 
protein (e.g., by contacting the cells with an inducer compound that effects transcription 
from an inducible promoter operably linked to the FATP gene). 

The test agent (e.g., an agonist or antagonist) is added to the cells to be used in a 
15 fatty acid transport assay, in the presence or absence of test agent, under conditions 
suitable for production and/or maintenance of the expressed FATP in a conformation 
appropriate for association of the FATP with test agent and substrate. For example, 
conditions imder which an agent is assessed, such as media and temperature 
requirements, can, initially, be similar to those necessary for transport of typical fatty 

2 0 acid substrates across the plasma membrane, One of ordinary skill in the art will know 

how to vary experimental conditions depending upon the biochemical nature of the test 
agent. The test agent can be added to the cells in the presence of fatty acid, or in the 
absence of fatty acid substrate, with the fatty acid substrate being added following tire 
addition of the test agent. The concentration at which the test agent can be evaluated 
2 5 can be varied, as appropriate, to test for an increased effect with increasing 
concentrations. 

Test agents to be assessed for their effects on fatty acid transport can be any 
chemical (element, molecule, compound), made -synthetically, made by recombinant 
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techniques or isolated from a natural source. For example, test agents can be peptides, 
polypeptides, peptoids, sugars, hormones, or nucleic acid molecules, such as antisense 
nucleic acid molecules. In addition, test agents can be small molecules or molecules of 
greater complexity made by combinatorial chemistry, for example, and compiled into 
5 libraries. These libraries can comprise, for example, alcohols, allcyl halides, amines, 
amides, esters, aldehydes, ethers and other classes of organic compounds. Test agents 
can also be natural or genetically engineered products isolated from lysates of cells, 
bacterial, animal or plant, or can be the cell lysates themselves. Presentation of test 
compounds to the test system can be in either an isolated form or as mixtures of 

10 compounds, especially in initial screening steps. 

Thus, the invention relates to a method for identifying agents which alter fatty 
acid transport, the method comprising providing the test agent to the cell (-wherein "cell" 
includes the plural, and can include cells of a cell strain, cell line or culture of primary 
cells or organ culture, for example), under conditions suitable for binding to its target, 

15 whether to the FATP itself or to another target on or in the cell, wherein the transformed 
cell comprises a FATP. 

In greater detail, to test one or more agents or compounds (e.g., a mixture of 
compounds can conveniently be screened initially) for inhibition of the transport 
function of a fatty acid transport protein, the agent(s) can be contacted with the cells. 

2 0 The cells can be contacted with a labeled fatty acid. The fatty acid can be, for example, 
a known substrate of tire fatty acid transport protein such as oleate or palmitate. The 
fatty acid can itself be labeled with a radioactive isotope, (e.g., 3 H or 14 C) or can have a 
radioactively labeled adduct attached. In other variations, the fatty acid can have 
chemically attached. to it a fluorescent label, or a substrate for an enzyme occurring 

2 5 within the cells, wherein the substrate yields a detectable product, such as a highly 

colored or fluorescent product. Addition of candidate inhibitors and labeled substrate to 
the cells comprising fatty acid transport protein can be in either order or can be 
simultaneous. 
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A second aliquot of cells, which can be called "control" cells (a "first" aliquot of 
cells can be called "test" cells), is treated, if necessary (as in the case of transformed 
, 'host"cells), so as to allow expression of the FATP gene, and is contacted with the 
labeled substrate of the fatty acid transport protein. The second aliquot of cells is not 
5 contacted with one or more agents to be tested for inhibition of the transport function of 
the protein produced in the cells, but is otherwise kept under the same culture conditions 
as the first aliquot of cells. 

In a further step of a method to identify inhibitors of a fatty acid transport 
protein, the labeled fatty acid is measured in the first and second aliquots of cells. A 

1 o preliminary step of this measurement process can be to separate the external medium 

from the cells so as to be able to distinguish the4abeled fatty acid external to the cells 
from that which has been transported inside the cells. This can be accomplished, for 
instance, by removing the cells from their growth container, centrifuging the cell 
suspension, removing the supernatant and performing one or more wash steps to 
1 5 extensively dilute the remaining medium which may contain labeled fatty acid. 

Detection of the labeled fatty acid can be by a means appropriate to the label used. For 
example, for a radioactive label, detection can be by scintillation counting of 
appropriately prepared samples of cells (e.g., lysates or protein extracts); for a 
fluorescent label, by measuring fluorescence in the cells by appropriate instrumentation. 

2 0 If a compound tested as a candidate inhibitor of transport function causes the test 

cells to have less labeled fatty acid detected hi the cells than that detected in the control 
cells, then the compound is an inhibitor of the fatty acid transport protein. Procedures 
analogous to those above can be devised for identifying enhancers (agonists of FATP s) 
of fatty acid transport function wherein if the test cells contain more labeled fatty acid 
2 5 than that detected in the control cells, or if the fatty acid is taken up at a higher rate, then 
the compound being tested can be concluded to be an enhancer of the fatty acid 
transport protein. 
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Exaniple 13 describes use of an assay of this type to identify an inhibitor of a 
FATP. In Example 13, an antisense oligonucleotide which specifically inhibits 
biosynthesis of mmFATP4 was demonstrated to inhibit fatty acid uptake into mouse 
enterocytes. Similarly, antisense oligonucleotides directed towards specifically 
5 inhibiting the biosynthesis of FATP6 in heart cells, FATP5 in liver cells, FATP3 in lung 
cells, and FATP2 in colon cells, can be demonstrated as examples of "test agents' 1 that 
inhibit fatty acid transport. 

Another assay to determine whether an-agent is an inhibitor (or enhancer) of 
fatty acid transport employs animals, one or more of which are administered the agent, 

1 0 and one or more of which are maintained under similar conditions, but are not 

administered the agent. Both groups of animals are given fatty acids (e.g., orally, 
intravenously, by tube inserted into stomach or intestine), and the fatty acids taken up 
into a bodily fluid (e.g., serum) or into an organ or tissue of interest are measured from 
comparable samples taken from each group of animals. The fatty acids may carry a 

15 label (e.g., radioactive) to facilitate detection and quantitation of fatty acids taken up 
into the fluid or tissue being sampled. This type of assay can be used alone or can be 
used in addition to in vitro assays of a candidate inhibitor or enhancer. 

An agent determined to be an inhibitor (or enhancer) of FATP function, such as 
fatty acid binding and/or fatty acid uptake, can be administered to cells in culture, or in 

2 0 vivo, to a mammal (e.g. human) to inhibit (or enhance) FATP function. Such an agent 
may be one that acts directly on the FATP (for example, by binding) or can act on an 
intermediate in a biosynthetic pathway to produce FATP, such as transcription of the 
FATP gene, processing of the mRNA, or translation of the mRNA. An example of such 
an agent is antisense oligonucleotide. 

2 5 Antisense methods similar to those illustrated in Example 13 can be used to 

determine the target FATP of a compound or agent that has an inhibitory or enhancing 
effect on fatty acid uptake. For example, antisense oligonucleotide directed to the 
inhibition of FATP4 biosynthesis can be added to lung cells or cell lines derived from 
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lung cells. In addition, antisense oligonucleotides directed to the inhibition of other 
FATPs, except for FATP3, can also be added to the lung cells. The administration of 
antisense oligonucleotides in this manner ensures that the predominant FATP activity 
remaining in the cells comes from FATP3. After a period of incubation of the cells with 
5 the antisense oligonucleotides sufficient to deplete the plasma membrane of the FATPs 
whose biosynthesis has been inhibited, a test agent, preferably one that has been shown 
by some preliminary test to have an inhibitory or enhancing activity on fatty acid 
transport, can. be added to the lung cells. If the test agent is now demonstrated, after 
treatment of the cells with antisense oligonucleotides, to have an inhibitory or 
10 enhancing activity on fatty acid transport in the lung cells, it can be concluded that the 
target of the test agent is FATP3, or a molecule involved in the biosynthesis or activity 
of FATP3. 

hi another type of cell-based assay for uptake of fatty acids, a change of 
intracellular pH resulting from the uptake of fatty acids can be followed by an indicator 

1 5 fluorophore. The fluorophore can be taken up by the cells in a preincubation step. Fatty 
acids caii be added to the cell medium, and after some period of incubation to allow 
FATP -mediated uptake of fatty acids, the change in A max of fluorescence can be 
measured, as air indicator of a change in intracellular pH, as the X max of fluorescence of 
the fluorophore changes with the pH of its environment, thereby indicating uptake of 

2 0 fatty acids. One such fluorophore is BCECF (2', 7'-bis(2-carboxyethyl)-5(6)- 
carboxyfluorescein; Rink, T.J. etal, J. Cell. Biol 95: 189 (1982)). 

In assays similar to those described above, a candidate inhibitor or enhancer of 
fatty acid transport function can be added (or mock-added, for control, cultures)to 
cultures of cells engineered to express a desired FATP to which fatty acid substrate is 

2 5 also added. Inhibition of fatty acid uptake is indicated by a lack of -the drop in pH, 
indicating fatty acid uptake, that is seen in control cells. Enhancement of fatty acid 
uptake is indicated by a decrease in intracellular pH, as compared to control cells not 
receiving the candidate enhancer of fatty acid transport function. 
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Yeast cells can be used in a similar cell-based assay for the uptake of fatty acids 
mediated by a FATP, and such an assay can be adapted to a screening assay for the 
identification of agents that inhibit or enhance fatty acid uptake by an FATP. Yeast 
cells lacking an endogenous FATP activity (mutated, disrupted or deleted for FAT J; 
5 Faergeman, NJ. et at., J. Biol. Chem. 272(13):8531-8538 (1997); Wafldns, P.A. et al, 
J. Biol Chem. 273(29):18210~18219 (1998)) can be engineered to harbor a related gene 
of the family of FATP-encoding genes, such as a mammalian FATP (e.g., human 
FATP4). 

Examples of expression vectors include pEG (Mitchell, D.A., et al, Yeast 9:115- 
10 723 (1993)) andpDADl andpDAD2, which contain a GAL1 promoter (Davis, L. L and 
Fink, G. rT, Cell 57:965-978 (1990)). A variety of promoters are suitable for 
expression. Available yeast vectors offer a choice of promoters. In one embodiment, 
the inducible GAL1 promoter is used. In another embodiment, the constitutive ADH1 
promoter (alcohol dehyrodrogenase; Bennetzen, J. L. and Hall, B. D., J. Biol Chem. 
1 5 257:3026-303 1 (1982)) can be used to express an inserted gene on glucose-containing 
media. An example of a vector suitable for expression of a heterologous FATP gene in 
yeast is pQB 169. 

With the introduced FATP gene providing the only fatty acid transport protein 
function for the yeast cells, it is possible to study effect of the heterologous FATP on 

2 0 fatty acid transport into the yeast cells in isolation. Assays for the uptake of fatty acids 
into the yeast cells can be devised that are similar to those described above and/or those 
assays that have been illustrated in the Examples. Tests for candidate inhibitors or 
• enhancers of the heterologous FATP can be done in cultures of yeast cells, wherein the 
yeast cells are incubated with fatty acid substrate and an agent to be tested as an 

2 5 inhibitor or enhancer of FATP function. FATP uptake after a period of time can be 
measured by analyzing the contents of the yeast cells for fatty acid substrate, as 
compared with control yeast cells incubated with the fatty acid, but not with the test 
agent. Yeast cells have the additional advantage, over mammalian cells in culture, for 



\VO 99/36537 



PCT/US99/00182 



-52- 

example, that yeast cells can be forced to rely upon fatty acids as their only source of 
carbon, if the growth medium supplied to the yeast cells is formulated to contain no 
other source of carbon. Thus, the effect of the heterologous FATP on fatty acid uptake 
and metabolism in the engineered yeast cells can be amplified. An agent that efficiently 
5 blocks transport function of the heterologous FATP could result in death of the yeast 
cells. Thus, in this case, inhibition of function of the heterologous FATP can result in 
loss of viability. A simple measure of viability is turbidity of the yeast suspension 
culture, which can be adapted to a high throughput screening assay for effects of various 
agents to be tested, using microtiter plates or similar devices for small- volume cultures 

10 of the engineered yeast cells. 

Cell- free assays can also be used to measure the transport of fatty acids across a 
membrane, and therefor also to assess a test treatment or test agent for its effect on the 
rate or extent of fatty acid transport. An isolated FATP, for example in the presence of 
a detergent that preserves the native 3-dimensional structure of the FATP, or partially 

15 purified FATP, can be used in an artificial membrane system typically used to preserve 
the native confonnation and activity of membrane proteins. Such systems include 
liposomes, artificial bilayers of phospholipids, isolated plasma membrane such as cell 
membrane fragments, cell membrane fractions, or cell membrane vesicles, and other 
systems in which the FATP can be properly oriented within the membrane to have 

2 0 transport activity. Assays for transport activity can be performed using methods 

analogous to those that can be used in cells engineered to predominantly express one 
FATP whose function is to be measured. A labeled (e.g., radioactively labeled) fatty 
acid substrate can be incubated with one side of a bilayer or in a suspension of 
liposomes constructed to integrate a properly oriented FATP. The accumulation of fatty 

2 5 acids with time can be measured, using appropriate means to detect the label (e.g., 
scintillation counting of medium on each side of the bilayer, or of the contents of 
liposomes isolated from the surrounding medium). Assays such as these can be adapted 
to use for the testing of agents which might interact with the FATP to produce an 
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inhibitory or an enhancing effect on the rate or extent of fatty acid transport. That is, the 
above-described assay can be done in the presence or absence of the agent to be tested, 
and the results compared. 

For examples of isolation of membrane proteins (ADP/ATP carrier and 
5 uncoupling protein), reconstitution into phospholipid vesicles, and assays of transport, 
see Klingenberg, M. et al t Methods EnzymoL 250:369-389 (1995). For an example of a 
membrane protein (phosphate carrier of Saccharomyces cerevisiae) that was purified 
and solubilized from£ coli inclusion bodies, see Schroer, A. et aL, J. Biol. Chem. 273: 
14269-14276 (1998). The Glutl glucose transporter of rat has been expressed in yeast. 

10 A crude membrane fraction of the yeast was prepared and reconstituted with soybean 
phospholipids into liposomes. Glucose transport activity could be measured in the 
liposomes (Kasahara, T. and Kasahara, M. ? J. Biol Chem. 273: 29113-29117 (1998)). 
Similar methods can b.e applied to the proteins and polypeptides of the invention. 
Another embodiment of the invention is a method for inhibiting fatty acid 

15 uptake in a mammal (e.g., a human), comprising administering to the mammal a 

therapeutically effective amount of an inhibitor of the transport function of one or more 
of the fatty acid transport proteins, thereby decreasing fatty acid uptake by cells 
comprising the fatty acid protein(s). Where it is desirable to reduce the uptake of fatty 
acids, for example, in the treatment of chronic obesity or as a part of a program of 

2 0 weight control or hyperlipidemia control in a human, one or more inhibitors of one or 
more of the fatty acid transport proteins can be administered in an effective dose, and by 
an effective route, for example, orally, or by an indwelling device that can deliver doses 
to the small intestine. The inhibitor can be one identified by methods described herein, 
or can be one that is, for instance, structurally related to an inhibitor identified by 

2 5 methods described herein (e.g., having chemical adducts to better stabilize or solubilize 
the inhibitor). The invention further relates to compositions comprising inhibitors of 
fatty acid uptake in a mammal, which may further comprise phamiaceutical carriers 
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suitable for administration to a subject mammal, such as sterile solubilizing or 
emulsifying agents. 

A further embodiment of the present invention is a method of enhancing or 
increasing fatty acid uptake, such as enhancing or increasing LCFA uptake in the small 
5 intestine (e.g., to treat or prevent a malabsorption syndrome or other wasting condition) 
or in the liver (e.g., by an enhancer of FATP5 transport activity to treat acute liver 
failure) or in the kidney (e.g., by an enhancer of FATP2 transport activity to treat kidney 
failure). In this embodiment, a therapeutically effective amount of an enhancer of the 
transport function of one or more of the fatty acid transport proteins can be administered 
10 to a mammalian subject, with the result that fatty acid uptake in the small intestine is 
enhanced, ha this embodiment, one or more enhancers of one or more of fatty acid 
transport proteins is administered in an effective dose and by a route (e.g., orally or by a 
device, such as an indwelling catheter or other device) which can deliver doses to the 
gut, The enhancer of F ATP function (e.g., an enhancer of FATP4 function) can be 
15 identified by methods described herein or can be one that is structurally similar to an 
enhancer identified by methods described herein. 

Aerobic reperfusion of ischemic myocardium is a common clinical event which 
can occur during such treatments as cardiac surgery, angioplasty, and thrombolytic 
therapy after a myocardial infarction. During reperfusion, a rapid recovery of 
2 0 myocardial energy production is essential for the complete recovery of contractile 

function. Not only the extent of recovery of myocardial energy metabolism but also the 
type of energy substrate used by the heart during reperfusion are important determinants 
of functional recovery. Circulating fatty acid levels increase following acute 
myocardial infarction or during cardiac surgery, such that during and following 
2 5 ischemia the heart muscle can be exposed to very high concentrations of fatty acids 
(Lopaschuk, G.D. and W. C. Stanley, Science and Medicine (November/December 
1997)). High plasma fatty acid concentrations increase the severity of ischemic damage 
in a number of experimental models of cardiac ischemia and have been linked to 
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depression of mechanical function during aerobic reperfusion of previously ischemic 
hearts. Further data show that modifying fatty acid utilization can be beneficial for 
heart function in ischemia and can be a useful approach for the treatment of angina. 
See, e.g., Desideri and Celegon, Am. J. Cardiol 52f5^):50K-53K; Lopaschuk, Am. J. 
5 Cardiol 82 (5 A) :14K-17K. Plasma fatty acid concentrations can be reduced by 

administering to a human subject or other mammal an effective amount of an inhibitor 
of a FATP such as FATP2 or FATP4, thereby providing a way of reducing fatty acid 
utilization by the heart. 

In a further embodiment of the invention, a therapeutically effective amount of 

10 an inhibitor of hsFATP6 can be administered to a human patient by a suitable route, to 
reduce the uptake of fatty acids by cardiac muscle. This treatment is desirable in 
patients who are diagnosed as having, or who are at risk of, abnormal accumulations of 
fatty acids in the heart or a detrimentally high rate of uptake of fatty acids into the heart, 
because of ischemic heart disease, or following ischemia or trauma to the heart. 

15 The invention further relates to antibodies that bind to an isolated or 

recombinant fatty acid transport protein of the FATP family, including portions of 
antibodies, which can specifically recognize and bind to one or more FATPs. The 
antibodies and portions thereof of the invention include those which bind to one or more 
FATPs of mouse or other mammalian species. In a preferred embodiment, the 

2 0 antibodies specifically bind to a naturally occurring FATP of humans. The antibodies 
can be used in methods to detect or to purify a protein of the present invention or a 
portion thereof by various methods of immunoaffmity chromatography, to inhibit the 
function of a protein in a method of therapy, or to selectively inactivate an active site, or 
to study other aspects of the structure of these proteins, for example. 

2 5 The antibodies of the present invention can be polyclonal or monoclonal. The 

term antibody is intended to encompass both polyclonal and monoclonal antibodies. 
Antibodies of the present invention can be raised against an appropriate immunogen, 
including proteins or polypeptides of the present invention, such as an isolated or 
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recombinant FATP1, FATP2, FATP3, FATP4, FATP5, FATP6, mtFATP, ceFATPa, ' 
ceFATPb, scFATP or portions thereof, or synthetic molecules, such as synthetic 
peptides (e.g., conjugated to a suitable carrier). Preferred embodiments are antibodies 
that bind to any of the following: hsFATPl, hsFATP2, hsFATP3, hsFATP4, hsFATPS 
5 or hsFATP6. The immunogen can be a polypeptide comprising a portion of a FATP 
and having at least one function of a fatty acid transport protein, as described herein. 

The term antibody is also intended to encompass single chain antibodies, 
chimeric, humanized or primatized (CDR-grafted) antibodies and the like, as well as 
chimeric or CDR-grafted single chain antibodies, comprising portions from more than 
1 0 one species. For example, the chimeric antibodies can comprise portions of proteins 
derived from two different species, joined together chemically by conventional 
techniques or prepared as a single contiguous protein using genetic engineering 
techniques (e.g., DNA encoding the protein portions of the chimeric antibody can be 
expressed to produce a contiguous protein chain. See, e.g., Cabilly et al., U.S. Patent 
15 No. 4,816,567; Cabilly et al, European Patent No. 0,125,023 Bl; Boss et al, U.S. 

Patent No. 4,816,397; Boss et al, European Patent No. 0,120,694 Bl; Neuberger, M.S. 
etaU WO 86/01533; Neuberger, M.S. et al, European Patent No. 0,194,276 Bl; 
Winter, U.S. Patent No. 5,225,539; Winter, European Patent No. 0,239,400 Bl; Queen 
et al, U.S. Patent No. 5,585,089; and Queen et al., European Patent No. EP 0 451 216 
2 0 BL See also, Newman, R. et al, BioTechnolog)', i 0:1455-1460 (1992), regarding 

primatized antibody, and Ladner et al, U.S. Patent No. 4,946,778 and Bird, R.E. et al } 
Science, 242:423-426 (1988) regarding single chain antibodies.) 

Whole antibodies and biologically functional fragments thereof are also 
encompassed by the tenn antibody. Biologically functional antibody fragments which 
2 5 can be used include those fragments sufficient for binding of the antibody fragment to a 
FATP to occur, such as Fv, Fab, Fab' and F(ab') 2 fragments. Such fragments can be 
produced by enzymatic cleavage or by recombinant techniques. For instance, papain or 
pepsin cleavage can generate Fab or F(ab') 2 fragments, respectively. Antibodies can 
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also be produced in a variety of truncated forms using antibody genes in which one or 
more stop codons have been introduced upstream of the natural stop site. For example, 
a chimeric gene encoding a F(ab')2 heavy chain portion can be designed to include DNA 
sequences encoding the CH t domain and hinge region of the heavy chain. 
5 Preparation of immunizing antigen (whole cells comprising FATP on the cell 

surface or purified FATP), and polyclonal and monoclonal antibody production can be 
performed using any suitable technique. A variety of methods have been described (See 
e.g.,-Kohler et al, Nature, 256: 495-497 (1975) and Eur. J. Immunol 6: 511-519 
(1976); Milstein et al, Nature 266: 550-552 (1977); Koprowski et al, U.S. Patent No. 

10 4,172,124; Harlow, B. and D. Lane, 1988, Antibodies: A Laboratory Manual, (Cold 

Spring Harbor Laboratory: Cold Spring Harbor, NY); Chapter 1 1 In Current Protocols 
In Molecular Biology, Vol. 2 (containing supplements up through Supplement 42, 
1998), Ausubel, F.M. et al, eds., (John Wiley & Sons: New York, NY)). Generally, a 
hybridoma can be produced by fusing a suitable immortal cell line (e.g., a myeloma cell 

15 line such as SP2/0) with antibody producing cells. The antibody producing cells, 

preferably those obtained from the spleen or lymph nodes, can be obtained from animals 
immunized with the antigen of interest. Immunization of animals can be by 
introduction of whole cells comprising fatty acid transport protein on the cell surface. 
The fused cells (hybridomas) can be isolated using selective culture conditions, and 

2 0 cloned by limiting dilution. Cells which produce antibodies with the desired specificity 
can be selected by a suitable assay (e.g., ELISA). 

Other suitable methods of producing or isolating antibodies (including human 
antibodies) of the requisite specificity can used, including, for example, methods which 
select recombinant antibody from a library (e.g., Hoogenboom et al, WO 93/06213; 

25 Hoogenboom et al, U.S. Patent No. 5,565,332; WO 94/13804, published June 23, 
1994; and Dower, WJ. et al, U.S. Patent No. 5,427,908), or which rely upon 
immunization of transgenic animals (e.g., mice) capable of producing a full repertoire of 
human antibodies (see e.g., Jalcobovits et al, Proc. Natl Acad. Set USA, 90: 2551-2555 
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(1993); Jalcobovits et aL, Nature, 562:255-258 (1993); Lonberg et al 9 U.S. PatentNo. 
5,569,825; Lonberg et ah, U.S. Patent No. 5,545,806; Surani et aL, U.S. Patent No. 
5,545,807; and Kucherlapati, R. et aL, European Patent No. EP 0 463 151 Bl). 

Another aspect of the invention is a method for directing an agent to cardiac 
5 muscle. The differential expression of FATP6 in cardiac muscle but not in other tissue 
types allows for the specific targeting of drugs, diagnostic agents, tagging labels, 
histological stains or other substances specifically to cardiac muscle. A targeting 
vehicle can be used for the delivery of such a substance. Targeting vehicles which bind 
specifically to FATP6 can be linked to a substance to be delivered to the cells of cardiac 
1 0 muscle. Hie linkage can be, for instance, via one or more covalent bonds, or by high 
affinity non-covalent bonds. A targeting vehicle can be an antibody, for instance, or 
other compound (e.g., a fatty acid or fatty acid analog) which binds to FATP6 with high 
specificity. 

Targeting vehicles specific to the heart-specific protein FATP6 have in vivo 
15 (e.g., therapeutic and diagnostic) applications. For example, an antibody which 

specifically binds to FATP6 can be conjugated to a drug to be targeted to the heart (e.g., 
a cardiac glycoside to treat congestive heart failure, or p-adrenergic agents, sodium 
channel blockers or calcium channel blockers to treat anhythmias). A substance (e.g., a 
radioactive substance) which can be detected (e.g., a label) in vivo can also be linked to 
2 0 a targeting vehicle which specifically binds to a heart-specific protein such as FATP6, 
and the conjugate can be used as a labeling agent to identify cardiac muscle cells. 

Targeting vehicles specific to FATP6 find further applications in vitro. For 
example, an FATP6-specific targeting vehicle, such as an antibody (a polyclonal 
preparation or monoclonal) which specifically binds to FATP6, can be linked to a 
2 5 , substance which can be used as a stain for a tissue sample (e.g., horseradish peroxidase) 
to provide a method for the identification of cardiac muscle in a sample, as can be used 
in embryology studies, for example. 
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Iii a similar manner, an agent can be directed to the liver of a mammal, as 
FATP5 is expressed in liver but not in other tissue types. A targeting vehicle which 
specifically binds to FATP5 can be conjugated to a drug for delivery of the drug to the 
liver, such as a drug to treat hepatitis, Wilson's disease, lipid storage diseases and liver 
5 cancer. As with targeting vehicles specific to FATP6, targeting vehicles .specific to 
FATP5 can be used in studying tissue samples in vitro. 

The invention also relates to compositions comprising a modulator of FATP 
function. The term "modulate" as used herein refers to the ability of a molecule to alter 
the function of another molecule. Thus, modulate could mean, for example, inhibit, 

10 antagonize, agonize, upregulate, downregulate, induce, or suppress. A modulator has 
the capability of altering function of its target Such alteration can be accomplished at 
any stage of the transcription, translation, expression or function of the protein, so that, 
for example, modulation of a target gene can be accomplished by modulation of the 
DNA or RNA encoding the protein, and the protein itself. 

1 5 Antagonists or agonists (inhibitors or enhancers) of the FATPs of the invention, 

antibodies that bind a FATP, or mimetics of a FATP can be employed in combination 
with a non-sterile or sterile earner or carriers for use with cells, tissues or organisms, 
such as a pharaiaceutical carrier suitable for administration to a mammalian subject. 
Such compositions comprise, for instance, a media additive or a therapeutically 

2 0 effective amount of an inhibitor or enhancer compound to be identified by an assay of 
the invention and a pharmaceutically acceptable carrier or excipient. Such carriers may 
include, but are not limited to, saline, buffered saline, dextrose, water, ethanol, 
surfactants, such as glycerol, excipients such as lactose arid combinations thereof. The 
formulation can be chosen by one of ordinary skill in the art to suit the mode of 

2 5 administration. The chosen route of administration will be influenced by the 

predominant tissue or organ location of the FATP whose function is to be inhibited or 
enhanced. For example, for affecting the function of FATP4, a preferred administration 
can be oral or through a tube inserted into the stomach (e.g., direct stomach tube or 
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nasopbaryngeal tube), or through other means to accomplish delivery to the small 
intestine. The invention further relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the 
aforementioned compositions of the invention. 
5 Compounds of the invention which are FATPs, FATP fusion proteins, FATP 

mimetics, FATP gene-specific antisense poly- or oligonucleotides, inhibitors or 
enhancers of a FATP may be employed alone or in conjunction with other compounds, 
such as therapeutic compounds. The pharmaceutical compositions may be administered 
in any effective, convenient manner, including administration by topical, oral, anal, 
1 0 vaginal, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, 

transdermal or intradermal routes, among others. In therapy or as a prophylactic, the 
active agent may be administered to an individual as an injectable composition, for 
example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively, the composition may be formulated for topical application, foi- 
ls example, in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain 
appropriate conventional additives, including, for example, preservatives, solvents to 
assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or 
2 0 ointment bases, and ethanol or oieyl alcohol for lotions. 

hi addition, the amount of the compound will vary depending on the size, age, 
body weight, general health, sex, and diet of the host, and the time of administration, the 
biological half-life of the compound, and the particular characteristics and symptoms of 
the disorder to be treated. Adjustment and manipulation of established dose ranges are 
2 5 well within the ability of those of skill in the art. 

A further aspect of the invention is a method to identify a polymorphism, or the 
presence of an alternative or variant allele of a gene in the genome of an organism (of 
interest here, genes encoding FATPs). As used herein, polymorphism refers to the 
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occurrence of two or more genetically determined alternative sequences or alleles in a 
population. A polymorphic locus may be as small as a base pair. Polymorphic markers 
include restriction fragment length polymorphisms, variable number of tandem repeats 
(VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide 
5 repeats, tetranucleotide repeats, simple sequence repeats, and insertion elements such as 
Alu. The first identified aileleic form, or the most frequently occurring form can be 
arbitrarily designated as the reference (usually, "wildtype") form, and other allelic forms 
are designated as alternative (sometimes, "mutant" or "variant' 1 ). Dipolid organisms 
may be homozygous or heterozygous for allelic forms. 

10 An "allele" or "allelic sequence" is an alternative form of a gene which may 

result from at least one mutation in the nucleotide sequence. Alleles may result in 
altered mRNAs or polypeptides whose structure or function may or may not be altered. 
Any given gene may have none, one, or many allelic forms (polymorphism). Common 
mutational changes which give rise to alleles are generally ascribed to natural deletions, 

15 additions, or substitutions of nucleotides. Each of these types of changes may occur 
alone, or in combination with the others, one or more times in a given sequence. 

Several different types of polymorphisms have been reported. A restriction 
fragment length polymorphism (RFLP) is a variation in DNA sequence that alters the 
length of a restriction fragment (Botstein et aL, Am. J. Hum. Genet. 32:314-331 (1980)). 

2 0 The restriction fragment length polymorphism may create or delete a restriction site, 
thus changing the length of the restriction fragment. RFLPs have been widely used in 
human and animal genetic analyses (see WO 90/13668; WO 90/11369; Donis-Keller, 
Cell 57:319-337 (1987); Lander et aL, Genetics 727:85-99 (1989)). When a heritable 
trait can be linked to a particular RFLP, the presence of the RFLP in an individual can 

2 5 be used to predict the likelihood that the individual will also exhibit the trait. 

Other polymorphisms take the form of short tandem repeats (STRs) that include 
tandem di-, tri- and tetra-nucleotide repeated motifs. These tandem repeats are also 
referred to as variable number tandem repeat (VNTR) polymorphisms. VNTRs have 
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been used in identity and paternity analysis (US 5,075,217; Armour et al, FEBS Lett- 
307:113-1 15 (1992); Horn et al, WO 91/14003; Jeffreys, EP 370,719), and in a large 
number of genetic mapping studies. 

Other polymorphisms take the form of single nucleotide variations between 
5 individuals of the same species. Such polymorphisms are far more frequent than 
RPLPs, STRs (short tandem repeats) and VNTRs (variable number tandem repeats). 
Some single nucleotide polymorphisms occur in protein-coding sequences, in which 
case, one of the polymorphic forms may give rise to the expression of a defective or 
other variant protein and, potentially, a genetic disease. Other single nucleotide 

1 0 polymorphisms occur in noncoding regions. Some of these polymorphisms may also 
result in defective protein expression (e.g, 5 as^a result of defective splicing). Other 
single nucleotide polymorphisms have no phenotypic effects. 

Many of the methods described below require amplification of DNA from target 
samples and purification of the amplified products. This can be accomplished by PCR, 

15 for instance. See generally, PCR Technology, Principles and Applications for DNA 
Amplification (ed. H.A. Erlich), Freeman Press, New York, NY, 1992; PCR Protocols: 
A Guide to Methods and Applications (eds. Innis, et al.), Academic Press, San Diego, 
CA, 1990; Mattila et al, Nucleic Acids Res, 19:4967 (1991); Eckert et aL, PCR 
Methods and Applications 1:11 (1991); PCR (eds. McPherson et al y IRS Press, 

2 0 Oxford); and US 4,683,202. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4:560 (1989); Landegren et al, Science 241:1017 

(1988) ), transcription amplification (Kwoh et al, Proc. Natl Acad. Set USA 86:1173 

(1989) , self-sustained sequence replication (Guatelli et al, Proc. Natl. Acad. Sci. USA 
2 5 87: 1874 (1990), and nucleic acid based sequence amplification (NASBA). The latter 

two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double stranded 
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DNA (dsDNA) as the amplification products in a ratio of about 30 or 1 00 to 1 , 
. respectively. 

Another aspect of the invention is a method for detecting a variant allele of a 
human FATP gene, comprising preparing amplified, purified FATP DNA from a 
5 reference human and amplified, purified, FATP DNA from a "test" human to be 
compared to the reference as having a variant allele, using the same or comparable 
amplification procedures, and determining whether the reference DNA and test DNA 
differ in DNA sequence in the FATP gene, whether in a coding or a noncoding region, 
wherein, if the test DNA differs in sequence from the reference DNA, the test DNA 

1 0 comprises a variant allele of a human FATP gene. The following is a discussion of 
some of the methods by which it can be determined whether the reference FATP DNA 
and test FATP DNA differ in sequence. 

Direct Sequencing. The direct analysis of the sequence of variant alleles of the 
present invention can be accomplished using either the dideoxy chain termination 

1 5 method or the Maxam and Gilbert method (see Sambroolc at al, Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Press, New York 1989; Zyskind et al, 
Recombinant DNA Laboratory Manual, Acad. Press, 1988)). 

Denaturing Gradient Gel Electrophoresis. Amplification products generated 
using the polymerase chain reaction can be analyzed by the use of denaturing gradient 

2 0 gel eletrophoresis. Different alleles can be identified based on the different sequence- 
dependent strand dissociation properties and electrophoretic migration of DNA in 
solution (chapter 7 in Erlich, ed. PCR Technology, Principles and Applications for DNA 
Amplification, W.H. Freeman and Co., New York, 1992). 

Single-strand Conformation Polymorphism Analysis. Alleles of target 

2 5 sequences can be differentiated using single-strand conformation polymorphism 

analysis, which identifies base differences by alteration in electrophoretic migration of 
single stranded PCR products, as described in Orita et al, Proc. Natl Acad. Sci. USA 
5(5:2766-2770 (1989). Amplified PCR products can be generated as described above, 
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and heated or otherwise denatured, to form single-stranded amplification products. 
Single-stranded nucleic acids may refold or form secondary structures which are 
partially dependent on the base sequence. The different electrophoretic mobilities of 
single-stranded amplification products can be related to base-sequence differences 
5 between alleles of target sequences. 

Detection of Binding by Protein That Binds to Mismatches. Amplified DNA 
comprising the FATP gene or portion of the gene of interest from genomic DNA, for 
example, of a normal individual is prepared, using primers designed on the basis of the 
DNA sequences provided herein. Amplified DNA is also prepared, in a similar manner, 

1 0 from genomic DNA of an individual to be tested for bearing a distinguishable allele. 
The primers used in PCR carry different labels, for example, primer 1 with biotin, and 
primer 2 with 32 P. Unused primers are separated form the PCR products, and the 
products are quantitated. The heteroduplexes are used in a mismatch detection assay 
using immobilized mismatch binding protein (MutS) bound to nitrocellulose. The 

1 5 presence of bio tin-labeled DNA wherein mismatched regions are bound to the 

nitrocellulose via MutS protein, is detected by visualizing the binding of strep tavidin to 
biotin. See WO 95/12689. MutS protein has also been used in the detection of point 
mutations in a gel-mobility-shift assay (Lishanski, A. at al t Proc. Natl. Acad, ScL USA 
Pi:2674-2678 (1994)). 

2 0 Other methods, such as those described below, can be used to distinguish a 

FATP. allele from a reference allele, once a particular allele has been characterized as to 
DNA sequence. 

Allele-specific probes. The design and use of allele-specific probes for 
analyzing polymorph™ s is described by e.g., Saiki et ai, Nature 324:163-166 (1986); 
2 5 Dattagupta, EP 235,726, Saiki, WO 89/1 1548. Allele-specific probes can be designed 
so that they hybridize to a segment of a target DNA from one individual but do not 
hybridize to the corresponding segment from another individual due to the presence of 
different polymorphic forms in the respective segments from the two individuals. 



WO 99/36537 



PCT/US99/00182 



-65- 

Hybridization conditions should be sufficiently stringent that there is a significant 
difference in hybridization intensity between alleles, and preferably an essentially 
binary response, whereby a probe hybridizes to only one of the alleles. Some probes are 
designed to hybridize to a segment of target DNA such that the polymorphic site aligns 
5 with a central position (e.g., in a 15-raer at the 7 position; in a 16-mer, at either the 8 or 
9 position) of the probe. This design of probe achieves good discrimination in 
hybridization between different allelic forms. 

Allele-specific probes are often used in pairs, one member of a pair showing a 
perfect match to a reference form of a target sequence and the other member showing a 
10 perfect match to a variant fonn. Several pairs of probes can then be immobilized on the 
same support for simultaneous analysis of multiple polymorphisms within the same 
target sequence. 

Allele-specific Primers. An allele-specific primer hybridizes to a site on target 
DNA overlapping a polymorphism, and only primes amplification of an allelic fonn to 

1 5 which the primer exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 

1 7:2427-2448 (1989). This primer is used in conjunction with a second primer which 
hybridizes at a distal site. Amplification proceeds from the two primers, resulting in a 
detectable product which indicates the particular allelic form is present. A control is 
usually performed with a second pair of primers, one of which shows a single base 

2 0 mismatch at the polymorphic site and the other of which exhibits perfect 

complementarity to a distal site. The single-base mismatch prevents amplification and 
no detectable product is formed. The method works best when the mismatch is included 
in the 3'-most position of the oligonucleotide aligned with- the polymorphism because 
this position is most destabilizing to elongation from the primer (see, e.g., WO 

2 5 93/22456)! 

Gene Chips. Allelic valiants can also be identified by hybridization to nucleic 
acids immobilized on solid supports (gene chips), as described, for example, in WO 
95/11995 and U.S. Patent No. 5,143,854, both of which are incorporated herein by 
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reference. WO 95/1 1995 describes subarxays that are optimized for detection of a 
characterized variant allele. Such a subarray contains probes designed to be 
complementary to a second reference sequence, which is an allelic variant of the first 
reference sequence. 

5 The present method is illustrated by the following examples, which are not 

intended to be limiting in any way. 

EXAMPLES 
Materials and Methods 

The following Materials and Methods were used in die work described in 

10 Examples 1-5. 

Sequence Alignment of FATP Clones. The DNA sequence for mouse FATP 1 
was obtained from the National Center for Biotechnology Information nonredundant 
database. cDNAs for mmFATP2, 3, 4, and 5 were obtained by screening mouse 
expression libraries (purchased from GIBCO/BRL) with probes derived from the cloned 

15 expressed sequence tags (ESTs) (Research Genetics, Huntsville, AL), Full-length 

clones were obtained for mmFATP2 and 5 and partial sequences for mmFATP3 and 4. 
The sequences described herein have been deposited in the GenBank database 
(Accession Nos. FATP2, AF072760; FATP3, AF072759; FATP4, AF072758; FATP5, 
AF072757). 

2 0 Neither FATP2 nor FATP5 contains an in-frame stop codon upstream of the 

putative initiator methionine; initiator methionines were assigned by homology with 
that in mmFATPl and by the presence of a signal sequence immediately after it. The 
Mycobacterium tuberculosis, Caenorhabditis elegans, and Saccharomyces cerevisiae 
sequences were present in the dbEST database as part of the sequencing projects for 

2 5 these organisms. Sequences were aligned utilizing a ClustalX algorithm and the 
resulting alignment exported to SeqVu. Homologous amino acid substitutions are 
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boxed in Figure 1 and were determined using the DayhofF 250 method with a 50% 
homology cutoff. 

Cell Transfection and LCFA Uptake. COS cells were cotransfected using the 
DEAE-dextran method with the mammalian expression vector pCDNA 3.1 (Invitrogen) 
5 expressing the gene for CD2 (pCDNA-CD2) in combination with either a pCDNA 3.1 
or pCMVSPOKT2 (GIBCO/BRL) expression vector containing one of the murine or 
nematode FATP genes (p CDNA -mmFA TP1 , p CDNA -FA TP 2, pCMVSPORT-FATP5 y 
pCDNA-ceFATPb). Two days after transfection, cells were assayed for CD2 expression 
with a phycoerythrin-coupled anti-CD2(PE-CD2) monoclonal antibody (PharMingen), 

1 0 and fatty acid uptake was assayed with a BODIPY-labeled fatty acid analogue 

(Molecular Probes). Briefly, cells were washed twice with PBS. (phosphate buffered 
saline) and stained with PE-CD2 at 4°C for 30 min in PBS containing 10% fetal calf 
serum. They were then washed three times with PBS/fetal calf serum for 5 min 
followed by an incubation for 2 min at 37°C in fatty acid uptake solution, which 

1 5 contained 0.1 \iM BODIPY-FA and 0.1% fatty acid-free BSA (bovine serum albumin) 
in PBS (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:427-436). After 2 min, the cells 
were washed four times with ice-cold PBS/0.1% BSA. The cells were then removed 
from the plates with PBS containing 5 mM EDTA and resuspended in PBS containing 
10% fetal calf serum and 10 mM EDTA. PE-CD2 and BODIPY-FA fluorescence were 

2 0 measured using a FACScan (Becton Dickinson). COS cells were gated on forward 
scatter (FSC) and side scatter (SS). Cells exhibiting more than 300 CD2 fluorescence 
units (dsim) representing 15% of all cells were deemed CD2 positive and their 
BODIPY-FA fluorescence was quantitated. 

E. coli-Based LCFA Uptake Assay. The full-length coding region of mtFATP 

2 5 and a control protein, the mammalian transcription factor TFE3, were subcloned into the 
inducible, prolcaryotic expression vector pET (Novagen). Expression was induced with 
1 mM isopropyl p-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced. 
Cells were washed in PBS/0.1% BSA and resuspended in 1 ml PBS/0.1% BSA 



WO 99/36537 



PCT/US99/00182 



-68- 

containing 0.1 |iM [ 3 H]palmitate (NEN) at 37° C. Uptake was stopped after the 
indicated incubation time by transferring the cells onto filter paper using a cell harvester 
(Braadel, Bethesda, MD). Filters were washed extensively with ice-cold PBS/0. 1% 
B S A, and [ 3 H]palmitate was quantitated by scintillation counting. 
5 Northern Blots. Northern blot analysis of murine F ATP expression was done 

using poly(A) niKNAblots (Clontech). Probes of each of the FATPs were derived from 
the 3 f untranslated regions of each gene and were <60% identical in sequence. Probes 
were labeled by random priming (Boehringer Mannheim) and hybridized at 65 °C. 
Blots were extensively washed in 0.2% SSC/0.1% SDS at 65 °C: 

1 o Generation of Phylogenetic Trees. Complete and partial sequences for FA TP 

genes from human, rat, mouse, puffer fish, Drosophila melanogaster, C. elegans, S. 
cerevisiae, and M. tuberculosis were aligned using ClustalX. A homologous region of 
48 amino acids (residues 472-519 inmmFATPl) from all of the genes was used to 
determine phylogenetic relationship within ClustalX. Based on these data a 
1 5 phylogenetic tree was generated using Tree View PPC (Figure 5). 

Nomenclature. It is proposed that the FATP genes be given a species specific 
prefix (mm, Mus musculus; hs, Homo sapiens; mt, M. tuberculosis; dm, D. 
melanogaster; ce, C. elegans, sc, S. cerevisiae) and numbered such that mammalian 
homologues in different species share the same number but differ in their prefix. Since 

2 0 the two C elegans genes cannot be paired with a specific human or mouse FATP, they 

have been designated ceFATPa and ceFATPb. 

Example 1: Identification of Novel Mammalian FATPs 

The National Center for Biotechnology Information EST database was screened, 
using the mouse FATP protein sequence (mmFATPl), to identify novel FATPs. This 
.25 strategy led to the identification of more than 50 murine EST sequences which could be 
assembled into five distinct contiguous DNA sequences (contigs). One contig was 
identical to the previously cloned FATP, which has been renamed FATP1 . Another, 
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which has been renamed FATP2, is the murine homologue of a rat gene previously . 
identified by others as a very long chain acyl-CoA synthase (Uchiyama, A., Aoyama, 
T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) /. Biol Chem. 
277:30360-30365). The other three contigs represented novel genes (FATP3, 4, and 5). 
5 Full-length clones for FATP2 and FA TPS and nearly complete sequences for FA TP 3 
and 4 (Figure 1) were obtained by screening cDNA libraries made from mouse day 
10.5 embryos and adult liver. Also identified were human homologues for each of the 
murine genes in the EST database. A sixth human gene was also identified; whether 
this gene is also present in the mouse will require additional studies. Map positions are 
1 0 given in Tables 2 and 3 . 

The genetic loci for all of the human genes, with the exception of FATP5 which 
was already mapped as an unlcnown EST, were determined using the radiation hybrid 
panels. The map positions given below show the distance (in centiRays) from the 
closest framework marker. As a guideline, there are approximately 300kb/cR. 
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Table 2. Mapping Data for Human Genes 



hsFATPl 
hsFATP2 

5 

hsFATP3 

hsFATP4 

1 0 hsEATPS 
hsFATP6 



Chromosome Chrl9 

places 13.35 cR from WI-6344 (lod>3.0) 
Chromosome Chrl5 
places 4.92 cR from D15S126 (lod>3.0) 
Chromosome Chrl 

places 13.24 cR from WI-2862 (lod>3.0) 
Chromosome Chr9 

places 7.80 cR from WI-9685 (lod>3.0) 

unknown EST previously mapped to near D19S418 

Chromosome Chr5 

places 1.41 cR from WI-4907 (lod>3.0) 



The mouse map is an internal baclccross panel consisting of 1 88 mouse 
baclccross DNA's plus 4 controls (B6> Spretus, Fl, Water). The baclccross was 

1 5 constructed by crossing B6 by Spretus animals and then crossing those Fl's back to B6. 
Mapping is accomplished by talcing advantage of recombinational events during 
meiosis, and the use of PGR primers to detect the differences (by size or re-annealing 
events) at any given locus between the B6 and Spretus allele. 

For the purposes of mapping, a novel set of primers (gene of interest) is used to 

2 0 amplify from all 188 DNA's and then typed as being a B6 ("B") or a Spretus ("S"). This 
string of B's and S ! s is entered into the Map Manager program, which does a best fit 
calculation by comparing the string of 188 typings from the gene of interest to all loci 
already extant in the panel, for all 20 chromosomes. The gene of interest is then 
assigned to a particular area on a particular chromosome according to a number of 

2 5 parameters, including the minimalization of double cross-overs, and die highest LOD 
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scores. Indicated in Table 3 are distances to the closest markers on either side of the . 
FATP locus. 



Table 3. Mapping Data for Mouse Genes 



minFATPl Chromosome 8 
5 places 2.82 cM from D8Mitl32 (lod 43.4) and 1.81 cM from D8Mit74 

God 43.5) 
mmFATP2 Chromosome 2 

places 1 .29 cM from D2Mit258 (lod 47.9) and 1 .75 cM from D2NDS3 
(lod 44.9) 
10 mmFATP3 Chromosome 3 

places 2.54 cM from D3Mit22 (lod 29.5) and 19.62 cM from D3MU42 
(lod 13.6) 
mmFATP4 Chromosome 2 

places 13.78 cM fromD2Mitl (lod 22.9) and 3.85 cM from D2Mit65 
15 (lod 41.9) 

mmFATPS Chromosome 7 

places 7.28 cM proximal of D7Mit21 (lod 28.3) 



Example 2: Assessment of Function 

The ability of the newly identified mouse genes to function as fatty acid 

2 0 transporters was assessed using a fluorescence-activated cell sorting-based assay. COS 
cells were transiently cotransfected with expression vectors encoding the cell surface 
protein CD2 and either mmFATPl, mmFATP2 5 or mmFATPS, respectively. Two days 
after transfection, COS cells were stained with an antibody to CD2 and then incubated 
with a BODIPY-labeled fatty acid [BODIPY-FA, (Schaffer, J.E. & Lodish, H.F. (1994) 

2 5 Cell 7P;427-436)]. The cells were then washed extensively, lifted off the dish, and 
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analyzed by fluorescence-activated cell sorting. As judged by the number of CD2- 
positive cells, the transfection efficiency was approximately 20-30%. Fatty acid uptake 
was quantitated in the transiently transfected COS cells by measuring the BODIPY-FA 
fluorescence of the CD2-positive cells. Expression of CD2 had no effect on fatty acid 
5 uptake as shown by the finding that COS cells expressing only the transfected CD2 
cDNA (CD2-positive) had the same low level of BODIPY-FA uptake as did 
untransfected (CD2-negative) control cells (Figure 2A, control). In COS cells 
cotransfected with CD2 and mmFATPl, mmFATP2, or mmFATP5, uptake of 
BODIPY-FA by the transfected (CD2-positive) cells was increased between 15- to 90- 
10 fold over control (CD2 cDNA only) cells (Figures 2A-2D). 

Example 3 : Expression Patterns of Murine FATPs 

Expression patterns of members of the murine FATP gene family were 
characterized by Northern blot analysis; to avoid cross-hybridization, the probes used 
were from the 3* untranslated region of these genes, which are less than 60% identical in 

1 5 sequence. The expression pattern of FATP 1 agrees with that previously found 

(Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:427-436). Here, expression was seen 
primarily in heart and kidney. FATP2 is expressed almost exclusively in liver and 
kidney, which corresponds to the reported tissue distribution of the rat homologue [very 
long chain acyl-CoA (VLACS)] as assessed by Western blotting (Uchiyama, A., 

20 Aoyama, T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) J. 
Biol Chem. 272:30360-30365). FATP3 is present in lung, liver, and testis. FATPS is 
expressed only in liver and cannot be detected in other tissues even when the blot is 
overexposed. The human homologue of FATP 5 is also liver specific and is not 
expressed in a wide array of other tissues tested, including fetal liver. 
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Example 4: FATPs Are Evolutionary Conserved 

The EST database was searched, using sequences conserved among the five 
murine FATP genes, for FATP genes in other organisms. Two homologues were found 
in C. elegans and one in M tuberculosis. One of the C. elegans genes was cloned from 
5 a cDNA library and expressed in COS cells, as described for the murine FATPs. 

Overexpression of the nematode FATP resulted in a 15-fold increase of BODIPY-FA 
uptake compared with control cells (Figure 3). The mycobacterial FATP gene was 
isolated from a phage library and assessed for its ability to facilitate fatty acid uptake. 
K coli transformed with a prolcaryotic, isopropyl P-D-thiogalactoside»ind.ucible 
1 0 expression vector containing the mycobacterial FATP gene demonstrated a significant 
increase in the rate of [ 3 H]palmitate uptake after induction, compared with umnduced 
bacteria or E. coli transformed with a control protein (Figure 4). Novel FATP genes 
were also identified in J 7 , rubripes (puffer fish) andD. melanogaster. 

Examples: Phylogenetic Tree of FATPs 

15 Faergeman et al (Faergeman, N.J., DiRusso. C.C., Elberger, A., Knudsen, J. & 

Black, P. N. (1997) J. Biol Chem. 272:8531-8538) identified three regions of very 
strong conservation between the scFATP and mmFATPl genes. The sequences of the 
FATPS were compared over a 31 1 -amino acid FATP "signature sequence 1 ' which 
includes these conserved regions corresponding to amino acids 246-557 in mmFATPl 

2 0 (underlined in Figure 1). When compared with the National Center for Biotechnology 
Information nonredundant database, only one region of the "FATP signature sequence" 
shows significant homology to other proteins. This small stretch of amino acids 
(underlined in Fig. 1) is an AMP-binding motif found in a multitude of other proteins,- 
such as acyl-CoA synthase, several CoA lipases, and gramicidin S synthetase 

2 5 component II (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:427-436). The relevance of 
this motif to fatty acid transport is unclear. Other highly conserved regions among the 
FATPs, including long stretches of amino acids >90% identical from mycobacteria to 
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humans, are not found in any other class of proteins. A 48-amino acid segment of the. 
FATP signature sequence was used to construct a phylogenetic tree (Figure 5). Each of 
the human and mouse genes form, their own branch; hsFATP6, which as yet has no 
murine homologue, is most closely related to hsFATP3 and mmFATPS. As expected, 
5 niVLACS is closer in sequence to mmFATP2 than to hsFATP2. The FATP genes of 
invertebrates i.e., C. elegans and D. melanogastei% are most closely related to each 
other. Surprisingly, the mycobacteral gene is more closely related to the human and 
mouse FATP 5 genes than to the FATPs of any of the lo.wer organisms. Whether this 
reflects coevolution of the mycobacterial and human genes awaits further study. 

10 Materials and Methods ' 

The following materials and methods were used in the work describedun 
Examples 6-10. 

Isolation of full-length human FATP1 and 4 

Fixll-length clones encoding human FATP1 and humau FATP4 were identified 
15 by searching databases for sequences similar to murine FATP 1-5 coding regions using 
the BlastX algorithm (Altschul et al, J. MoL Biol 21 5\ 403-410, 1990). 

A concatamer of nucleotide sequences comprising the coding sequences of 
mmFATPl (Genbank Accession U15976), mmFATP2, mmFATP3 (SEQ ID NO;6), 
mmFATP4 (SEQ ID NO; 8) and mmFATPS (SEQ ID NO: 10) was used to search the 
2 0 Millennium database using the BLASTX algorithm. Sequences with a score >1 50 were 
evaluated for whether they represented known FATP coding sequences. 

Human clones with similarity to die 5 s end of murine FATP sequences were 
sequenced completely. Clones encoding full-length human FATP1 were obtained from 
a heart cDNA library constructed in the mammalian expression vector pMET7 
25 (Tartagliaef a/., Cell, 83: 1263-1271, 1995). Clones encoding full-length human 
FATP4 were obtained from a spleen cDNA library constructed in the mammalian 
. expression vector pMET7. 
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Isolation of full-length human FATP6 

Several clones encoding human FATP6 were identified by searching public 
databases as described above. Five clones were analyzed further by restriction digestion 
and DNA sequencing. One of these clones (Genbank Accession # AA4 12064) appeared 
5 to be full-length and its entire insert was sequenced. 

DNA Sequence Analysis 

Sequences were aligned with the DNAStar program using the Clustal method. 
Hydrophobicity plots were generated with DNA Strider using the Kyte Doolittle 
method. 

10 In situ hybridization 

Tissues were collected from 8 week old C57/B16 mice. Tissues were fresh, 
frozen, cut on a cryostat at 10 \im thickness and mounted on Superfrost Plus slides 
(VWR). Sections were air dried for 20 minutes and then incubated with ice cold 4% 
paraformaldehyde (PFA)/phosphate buffered saline (PBS) for 10" minutes. Slides were 

1 5 washed 2 times 5 minutes with PBS, incubated with 0.25% acetic anhydride/1 M 

triethanolamine for 10 minutes, washed with PBS for 5 minutes and dehydrated with 
70%, 80%, 95% and 100% ethanol for 1 minute each. Sections were incubated with 
chloroform for 5 minutes. Hybridizations were performed with 35 S-radiolabeled (5xl0 7 
cpm/ml) cRNA probes generated from the 3' untranslated regions of mouse FATPs by 

2 0 PCR followed by in vitro transcription in the presence of 50% fomiamide, 10% dextran 
sulfate, Ix Denbardt's solution, 600 mM NaCl, 10 mMDTT, 0.25% SDS and 10 ^ig/ml 
tRNA for 18 hours at 55 °C. After hybridization, slides were washed with 10 mM Tris- 
HC1 pH 7.6, 500 mM NaCl, 1 mM EDTA (TNE) for 10 minutes, incubated in 40 
jig/ml RNase A in TNE at 37°C for 30 minutes, washed in TNE for 10 minutes, 

2 5 incubated once in 2x SSC at 60°C for 1 hour, once in 0.2x SSC at 60°C for 1 hour, 
once in 0.2x SSC at 65 °C for 1 horn- and dehydrated with 50%, 70%, 80%, 90% and 



WO 99/36537 



PCT/US99/00182 



-76- 

100% ethanol. Localization of mRNA transcripts was detected by dipping slides in . 
Kodak NBT-2 photoemulsion and exposing for 7 days at 4°C S followed by development 
with Kodak Dektol developer. Slides were counter stained with haematoxylon and 
eosin and photographed. Controls for the in situ hybridization experiments include the 
5 use of a sense probe which showed no signal above background in all cases. 

Northern Blotting 

Human niRNA blots were obtained from Invitrogen or Clontech. PCR fragments 
from the 3' untranslated regions of human FATPs were used as probes. Blots were 
probed with 32 P-labeled DNA probes using the Rapid-Hyb buffer (Amersham) 

1 0 according to the manufacturer's instructions. 

Cell transfection and LCFA uptake. COS cells were cotransfected, using 
lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 
combination with a pMET7 expression vector (Tartaglia et aL t Cell, 83: 1263-1271, 

1 5 1995) containing hsFATPl (pMET7-hsF ATP 1 ) or hsFATP4 (pMET7-hsFATP4) or 

pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
a phycoerythrin-coupled anti-CD2 (PE-CD2)' monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
Probes) as described above. 

2 0 Example 6: Detemiination of Expression of mmFATPs 

mmFATP4, and to lesser extent mmFATP2 5 are expressed at high levels in the 
brush border layer of the small intestine. 

Cell transfection and LCFA uptake. COS cells were cotransfected, using 
lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
2 5 mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 
combination with a pMET7 expression vector (Tartaglia et ah, Cell, 83:1263-1271, 
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1995) containing hsFATP 1 (pMET7-hsF ATP 1 ) or hsFATP4 (pMET7-hsFATP4) or • 
pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
a phycoerythrin-coupled anti-CD2 (PE-CD2) monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
5 Probes) as described above. 

Absorption of dietary fat requires transport of free fatty acids across the apical 
membrane of epithelial cells in the small intestine. Previous studies suggested that this 
transport is protein-mediated; however, the transport protein had not yet been identified. 
In situ hybridization was performed on each of the three regions of the small intestine — 

1 0 duodenum, jejunum and ileum - as well as the colon, using probes from the 3 5 
untranslated regions of mnxFATPl, mmFATP2 s mmFATP3, mmFATP4 and 
mmFATP5 5 to determine whether any of the mouse FATPs are expressed in the small 
intestine. It was expected that a protein involved in fatty acid absorption would be 
expressed in the epithelial cells of the small intestine, but absent from the colon. 

1 5 Expression of mmFATPs in the jejunum was identical to that in the ileum in all 

cases. High levels of mmFATP4 mRNA were present in the epithelial cells of the 
jejunum and ileum, and lower, but significant, amounts were detected in the epithelial 
cells of the duodenum. Significantly, FATP4 mRNA was absent from other cell types 
of the small intestine and no FATP4 mRNA could be detected in any of the cells of the 

2 0 colon. FATP2 mRNA was present in the epithelial cells of the duodenum at a level 

similar to that of FATP4, but was present at lower levels in the jejunum and ileum. No 
signals above background were detected for mmFATPl, mmFATP3 and iumFATP5 in 
any of the intestinal tissues. mmFATP3 and FATPS were clearly detectable by in situ 
hybridization in adult liver and mmFATPl could be detected in a variety of tissues on a 

2 5 whole embryo in situ, indicating that the FATP1, 3, and 5 probes were working. 

mmFATP4 expression is predominant in the small intestine compared to the 
other organs of the mouse embryo. In the small intestine, FATP4 expression is limited 
to differentiated enterocytes, while no signal is detected in the connective tissue or the 
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undifferentiated epithelial cells in the crypts. Differentiated enterocytes are known to be 
tlie cells that mediate the uptake of fatty acids. FATP4 is specifically and strongly 
expressed in the epithelial cells of adult murine duodenum and ileum but not colon. 
Other FATPs, such as FATP5, are not expressed in the small intestine. Thus, FATP4 is 
5 the major FATP in the mouse small intestine. Given its high level of expression, it is 
likely that FATP4, and to a 'lesser extent FATP2, play an important role in the 
absorption of fatty acids. 

nraiFATP2, and mmFATP5 are expressed in hepatocytes 

Northern analysis of mmFATP2, mmFATP3, mmFATP4 and mmFATP5 

1 0 showed expression in the liver. To determine whether these proteins are present in 
hepatocytes or other cells types present in liver homogenates, in situ hybridizations 
were performed. mmFATP2 5 and mmFATPS mRNA was clearly present in 
hepatocytes, and was not concentrated in other cell types such as endothelial cells or 
macrophages. No signal above background was detected for mmFATPl in any of tire 

15 cell types in the liver, consistent with the results of the Northern blotting. 

Example 7: Isolation and Sequence Analysis of Full-length Human FATP1 and Full- 
length Human FATP4 

To identify human cDNA clones encoding FATP family members, Millennium 
databases were searched for sequences similar to murine FATP 1-5 coding regions. Two 

2 0 clones were analyzed in detail; inspection of the entire DNA sequence of these two 
clones showed that they encode the human orthologs of mmFATPl and mm FATP4, 
respectively. These two clones were designated hsFATPl and h$FATP4, and their 
DNA and predicted protein sequences are shown in Figures 44A-44C and 45, and 50A- 
50C and 51. hsFATPl is predicted to encode a 646 amino acid, 71 IcD protein with 

25 multiple membrane-spanning domains (Figure 28A). HsFATP4 is predicted to encode a 
643 amino acid, 72 kD protein with multiple membrane spanning domains (See Figure 
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29 A). A comparison of the DNA sequences of mouse and human FATP1 and mouse, 
and human FATP4 (Figures 30A-30B and 31A-31B) shows that the mouse and human 
orthologs are 85% (FATP1) and 87% (FATP4) identical to each other within the coding 
sequences given in these figures. At the amino acid level, hsFATPl and hsFATP4 are 
5 -90% identical to their respective mouse orthologs within the coding region shown in 
these figures (Figures 32 and 33). The sequence identities between mouse and human 
FATP1 and FATP4 are considerably higher than the ones observed between different 
FATP family members within one species (~40%-60%) and are present in the N- 
terminal part of the protein, a region that is poorly conserved between different FATP 

1 0 family members. This high degree of sequence conservation clearly demonstrates that 
the newly identified human FATPs are orthologs of mouse FATP1 and FATP4 rather 
than novel FATP family members. 

Table 4 is an identity/similarity matrix comparing the amino acid sequences of 
FATP1 and 4 from human and mouse. This shows that the gene whose sequence is 

15 shown in Figure 43 A is indeed human FATP4, since it is 91% identical with the murine 
FATP4 but only 62% identical with the closest related human FATP, which is FATP1 . 



Table 4 


Identity/Similarity Matrix 




hsFATP4 


mmFATP4 


hsFATPl 


mmFATPl 


hsFATP4 




93.2 


72.3 


72.0 


mmFATP4 


91.0 




71.2 


71.1 


hsFATPl 


61.9 


61.0 




92.4 


mmFATPl 


60.7 


59.6 


89.5 





Example 8: Isolation and Sequence Analysis of Full-length Human FATP6 

A search of EST databases identified a set of overlapping human sequences that 
2 5 were similar to FATPs, but did not have a clear mouse ortholog. One of these EST 
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clones was found to encode a full-length cDNA. The entire insert of this clone was 
sequenced and designated hsFATP6. The DNA and predicted protein sequences of 
hsFATP6 are shown in Figures 54A-54C and 55. HsFATP6 is predicted to encode a 
619 amino acid, 70 kD protein with multiple membrane-spanning domains (Figure 
35A). - A comparison of the amino acid sequences of hsFATP6 with other human 
FATPs shows about 37% identity to either hsFATPl or hsFATP4 (Figure 36). This 
degree of sequence identity is similar to what is observed between different mouse 
FATPs. The phylogenetic analysis described above clearly demonstrates that hsFATP6 
is a member of the FATP family, but not an ortholog of any of the mouse FATPs. 
Comparisons were done with "ALIGN" (E. Myers and W. Miller, "Optimal Alignments 
in Linear Space," CABIOS 4:1 1-17 (1988) using standard settings. 

Example 9 : Tissue Distribution of Human FATPs 

The tissue distribution of human FATPs was assessed by Northern blotting. 
Human FATP3 was expressed in a large variety of tissues. In contrast, human FATP5 
15 was present at high levels in the liver, but was undetectable in all other tissues 

examined. Thus, both hsFATP3 and hsFATPS recapitulate the expression pattern of 
their mouse orthologs (see above). HsFATP6 is a novel FATP with no mouse ortholog 
as yet. Northern blotting shows that hsFATP6 is expressed at high levels in the heart, 
but is undetectable in other tissues, including skeletal and smooth muscle. This tissue 
2 0 distribution suggests that human FATP6 performs an important role in energy 

metabolism in die heart; blocking FATP6-mediated fatty acid transport may therefore 
be beneficial for a number of heart diseases, e.g., ischemic heart disease. 

To identify the major FATP expressed in the human small intestine, Northern 
blotting was performed on a blot containing mRNA from human stomach, jejunum, 
2 5 ileum, colon, rectum and lung. hsFATPS and hsFATP6 were undetectable in any of 
these tissues. FATPS is only expressed in liver and FATP 6 only in heart. hsFATP2 
was weakly expressed in the colon, and an even weaker signal was detectable in 
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jejunum, ileum and lung lanes. hsFATP3 was expressed well in the lung, but was only 
weakly expressed in the other tissues tested. Importantly, no difference was seen in the 
expression of hsFATP3 between small intestine and stomach or colon, suggesting that 
the expression observed is not related to fatty acid absorption in the small intestine. 
5 hsFATP4 was clearly expressed in both jejunum and ileum; expression was 

significantly lower in the colon and was absent in the stomach. This expression pattern 
is consistent with a major role for FATP4 in absorption of fatty acids in the human gut. 

Example 10; Expression of hsFATPl and hsFATP4 Promotes Transport of Fatty Acids 
COS cells were cotransfected using lipofectamine with the mammalian 

1 0 expression vector pCDNA-CD2 in combination with one of the FATP-containing 

expression vectors (pMET7-hsFATPl or pMET7-hsFATP4) or an insertless expression 
vector (pMET7, control) as described in Materials and Methods for Examples 6-10. 
COS cells were gated on forward scatter and side scatter. Cells exhibiting more than 
400 CD2 fluorescence units representing -30% of all cells were deemed CD2-positive. 

15 The percent of CD2-positive cells exhibiting a BODEPY-fluorescence of >300 is plotted 
for the three different vectors tested (Figure 37). 

Example 1 1: Stable Expression of Human FATP4 in 293 Cells 

Stable cell lines were 'generated as follows. A DNA fragment containing the 
entire hsFATP4 coding sequence as well as 100 nucleotides of 5' and 50 nucleotides of 

20 3' untranslated region was inserted into the vector pIRES-neo (Clontech) using standard 
cloning techniques. The resulting construct or a vector control (pIRES-neo) was 
transfected into 293 cells using the lipofectamine method (Gibco BRL) according to the 
manufacturer's directions. Cells that had taken up the DNA were selected with 1 mg/ml 
G418 (Gibco BRL). Single colonies were picked 1 to 2 weeks after transfection and 

2 5 grown in medium containing 0.8 mg/ml G418. Colonies were screened for the ability to 
take up fatty acids by measuring uptake of a fluorescently labeled fatty acid (BODIPY- 
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FA). About 40 colonies trarisfected with tlie pIRES-neo containing FATP4 and -20 . 
colonies transfected with pIRES-neo control were analyzed. All 20 of the vector control 
clones showed amounts of BODIFY-FA uptake similar to each other and to 
untransfected 293 cells. In contrast, among the 40 FATP4 transfected clones, 3 had a 5- 
5 to 10-fold increased BODIPY-FA uptake compared to any of the vector controls, and a 
large number (-20) showed an approximately two-fold increase in BODIPY-FA levels. 
This distribution is consistent with FATP4 confening increased fatty acid uptake in 
these cells. One of the cell lines with the highest amount of BODIPY-FA uptake was 
selected. to be used for measuring uptake of tritiated fatty acid. 
10 The uptake of tritiated oleate over time by either FATP4. expressing or control 

cells was assayed over time. Expression of FATP4 increases the rate of fatty acid 
uptake by over 3-fold, demonstrating that FATP4 is, like the other FATPs, a functional 
fatty acid transporter (Figure 38). 

Example 12: Immuno-staining with FATP4-Specific Antiserum 
15 A polyclonal antiserum against the C-terminus of mmFATP4 was raised using a 

GST-fusion protein having mmFATP4-specific amino acid sequence 552-643 
(AVASP...GEEKL). In western blot experiments, the purified antibody reacted strongly 
with a synthetic peptide matching the C-teiminus of mmFATP4, but not with a 
corresponding region of mmFATP2, mmFATP3, or mmFATP5. The mmFATP4 
2 0 specific polyclonal antiserum detects, in western blot experiments with enterocyte 
ly sates from 3 different mice, a -70 kDa protein, which is in accordance with 
imnFATP4's predicted molecular weight of 72 kDa. The binding is specific for 
nimFATP4, since it can be completely abolished by preincubation of the antiserum with 
the GST-fusion peptide used to raise the antibody. 
■ 25 Immunofluorescence experiments were performed using the anti-mmFATP4 

antiserum on fresh frozen sections of murine small intestine. The antibody binding 
demonstrates strong expression of mmFATP4 in enterocytes, confirming the results of 
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the in situ hybridization experiments. At higher magnifications it is apparent that 
mmFATP4 is expressed at the apical side of the enterocyte, indicating that the 
transporter is present in the brush border membrane, which is known to mediate the 
uptake of fatty acids from the intestinal lumen. 

Immuno-electron microscopy studies were performed on fresh frozen murine 
intestinal cells. The gold particles used, appearing as black specks on the electron 
micrographs, indicate the subcellular localization of mmFATP4 to be on the microvilli 
of the enterocyte. It can be seen from the electron micrographs that mmFATP4 is 
localized exclusively in membranes, preferentially the apical plasma membrane, 
confirming that it is indeed a membrane protein. 

Example 13: Inhibition of Fatty Acid Uptake Specific to FATP4 Demonstrated in. 
Isolated Mouse Enterocytes 

Phosphorothioate derivatives of the following oligonucleotides were 

synthesized: 

FATP4-AS2 CCCCCACCAGAGAGGCTCC (SEQ ID NO: 1 00) 

FATP4-AS2MM CCACCCCCGGAAAGCCTGC (SEQ ID NO.101) 

FATP4-S2 GGAGCCTCTCTGGTGGGGG (SEQ ID NO:102) 

FATP4 AS2 is the antisense oligo; it is designed to be complementary to the sequence 
extending from nucleotide 10 to nucleotide 28 of the mouse FATP4 coding sequence. 
20 FATP4-AS2MM is a control oligo; in the oligo every third nucleotide was changed 
creating mismatches; the overall nucleotide composition is identical to FATP4-AS2 
(same number of G, A, T, C). FATP4-S2 is the sense control. 

Enterocytes were isolated from the small intestine of mice and incubated for 48h 
in tissue culture (Figure 40) either without oligonucleotides (squares) or with 100 \iM 
25 FATP4 specific sense (circles) or antisense (diamonds) oligonucleotides. The uptake 
over time of 25 yM oleate was then measured. While the FATP4 sense oligonucleotide 
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did not significantly influence the uptake, the antisense oligonucleotide inhibited fatty 
acid uptake by - 50%. 

The effect of either FATP4 sense, antisense or mismatch sequence 
oligonucleotides on the uptake of fatty acids was measured in enterocytes. Isolated 
5 enterocytes were incubated with increasing concentrations of FATP4 antisense 

oligonucleotides (solid bars in Figure 41), or a mismatch control oligonucleotide with 
identical nucleotide composition (stippled bars), or with 100 p,M of the FATP4 sense- 
oligonucleotide (lined bar). The medium for this incubation was Dulbecco's modified 
Eagle's medium with 4.5 g/L glucose, 1 mM sodium pyruvate, 0.01 mg/ml human 

1 0 transferrin and 1 0% fetal bovine serum. After 48 hours of incubation the uptake of 

oleate by enterocytes was measured over a 5 minute time interval. Measurements were 
done in quadruplicate. The uptake assay was done in Hank's buffered salt solution with 
10 mM taurocholate. Only the enterocytes given FATP4 antisense oligonucleotide 
showed a concentration dependent decrease of fatty acid uptake, inhibiting it at a 100 

L5 \xM concentration by -50%. This effect was FATP4 specific, since only the antisense 
oligonucleotide which can bind to the FATP4 mRNA and block its translation inhibited 
uptake, but not a control oligonucleotide differing only in the sequence but not the 
nucleotide content, ruling out a toxic or otherwise nonspecific inhibitory effect of this 
oligonucleotide due to its chemical composition. 

2 0 As a further control experiment, the uptake of oleate was measured along with 

the uptake of methionine in the same cultured enterocytes. Antisense oligonucleotide, 
mismatch sequence oligonucleotide, or no oligonucleotide was added to a concentration 
of 100 |iM to cultures of enterocytes. After incubation for 48 hours, the uptake of both 
3 H-labeled oleate and 35 S-labeled methionine was assayed. Results are shown in Figure 

2 5 .42. Fatty acid uptake is at the left side of the paired bars; methionine uptake is on the 
right side of the paired bars. The fact that amino acid uptake was not influenced by the 
antisense oligonucleotide treatment further supports the conclusion that the antisense 
oligonucleotide causes a specific reduction in translation of FATP4-specific mRNA. 
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Example 14: mmFATP2 Is Expressed in Proximal Renal Tubule Epithelium 

Northern analysis showed that mmFATPl, mmFATP2, and mmFATP4 are 
present in the kidney. In situ hybridization (methods as for Example 6) was performed 
to determine which cell type(s) of the kidney these mRNAs are expressed in. 
5 mmFATP 1 mRNA was present in virtually all cells throughout the kidney with no 
obvious preference for a particular cell type, In contrast, inmFATP2 was expressed 
only in the renal cortex. Within the cortex, expression of mmFATP2 was restricted to 
the epithelial cells of the proximal renal tubules. The primary function of proximal 
renal tubule cells is the reabsorption of filtered salts and nutrients (e.g., glucose), a 

1 0 process that requires mitochondrial oxidation and that can utilize fatty acids as energy 
substrates. Based on the localization of mmFATP2, it is possible that nimFATP2 is 
important for reabsorption in the kidney by allowing uptake of an energy source (fatty 
acids) from the blood into renal epithelial cells. Alternatively, if fatty acids need to be 
reabsorbed in the kidney, similarly to glucose, FATP2 could be involved in the 

1 5 reabsorption of fatty acids. Determination of the subcellular localization of FATP2 will 
distinguish between these two possibilities. 

Table 5 summarizes data on expression of the mouse FATPs in various organs. 
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Example 1 5 : Isolation of full-length human FATP3 

Full-length clones encoding human FATP3 were identified by searching 
databases for sequences similar to the murine FATP1-5 coding regions using the BlastX 
algorithm (Altschul et aL 9 J. MoL Biol 215: 403-410, 1990). Human clones with 
5 similarity to the 5 1 end of murine FATP sequences were sequenced completely. A clone 
encoding full-length human FATP3 was obtained from a human bone library 
constructed in the mammalian expression vector pMET7 (Tartaglia, L.A, et al, Cell 83: 
1263-1271, 1995). To identify human cDNA clones encoding F ATP . family members, 
databases were searched for sequences similar to murine FATP 1-5 coding regions. One 

10 clone was found to encode the human ortholog of mmFATP3 and was designated 

hsFATP3. The DNA and predicted protein sequences of hsFATP3 are shown in Figures 
94A and 94B. hsFATPS is predicted to encode a 703 amino acid 75.6 kD protein with 
multiple membrane-spanning domains. A comparison of the DNA sequences of mouse 
and human FATP3 shows that the mouse and human orthologs are 81% identical to 

15 each other within the coding region. At the amino acid level, hsFATP3 is -86% 
identical to mm FATP3 within the coding region. The sequence identities between 
mouse and human FATP3 are considerably higher than those observed between 
different FATP family members within one species (-40%) and are present in the N- 
terminal part of the protein, a region that is poorly conserved between different FATP 

2 0 family members. 

All references cited herein are incorporated by reference in their entirety. 
While this invention has been particularly shown and described with references 
to preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 

2 5 spirit and scope of the invention as defined by the appended claims. 
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CLAIMS 



What is claimed is: 



1 . An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of; 

5 a) a nucleotide sequence which encodes a protein consisting of the amino 

acid sequence of FATP2 in SEQ ID NO;49; 

b) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP4 in SEQ ID NO:53; and 

c) a nucleotide sequence which encodes a protein consisting of the amino 
1 o acid sequence of FATP6 in SEQ ID NO:57. 

2. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) the nucleotide sequence in SEQ ID NO:48; 

b) the nucleotide sequence in SEQ ID NO:52; and 
15 c) the nucleotide sequence in SEQ ID NO:56, 

3 . An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP2 in SEQ ID NO:48; 
20 b) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP4 in SEQ ID NO : 52; and 

c) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP6 in SEQ ID NO:56. 
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4. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP2; 

b) a nucleotide sequence which consists of the coding region of FATP4; 
and 

c) a nucleotide sequence which consists of the coding region of FATP6. 



5. An isolated nucleic acid molecule comprising at least 15 contiguous nucleotides 
of a nucleotide sequence selected from the group consisting of : 

a) SEQ ID NO:48, or of the complement thereof; 
10 b) SEQ ID NO:52 ? or of the complement thereof; and 
c) SEQ ID NO:56, or of the complement thereof. 

6. An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a contiguous portion of at least about 15 amino acids of a sequence 
selected from the group consisting of SEQ ED NO:48, SEQ ID NO:52, and SEQ 

15 IDNO:56. 

7. An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a naturally occuning allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 

2 0 sequence selected from the group consisting of SEQ ID NO:48, SEQ ID NO:52, 

and SEQ ED NO:56. 



An isolated nucleic acid molecule having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide comprising an amino acid 
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sequence selected from the group consisting of SEQ ID NO:49, SEQ ID NO:53, 
and SEQIDNO:57. 

9. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a . 
polypeptide, wherein said nucleotide sequence is at least 95% similar to the 

5 nucleotide sequence of a nucleotide sequence selected from the group consisting 

of SEQ ID NO:48, SEQ ID NO:52, and SEQ ID NO:56. 

10. An isolated nucleic acid encoding a fatty acid transport protein having an amino 
acid sequence sharing at least about 95% amino acid sequence similarity with an 
amino acid sequence selected from the group consisting of SEQ ID NO:49, SEQ 

10 ID NO:53> and SEQ ID NO:57. 

11. An isolated nucleic acid molecule encoding a fusion polypeptide, said nucleic 
acid molecule comprising a nucleotide sequence encoding a portion of an amino 
acid sequence selected from the group consisting of SEQ ID NO;49, SEQ ID 
NO:53> and SEQ ID NO:57, and further comprising a nucleotide sequence 

15 encoding a heterologous portion of said fusion polypeptide. 

12. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein comprising the ammo 

acid sequence of FATP2 in SEQ ID NO:49; 
2 0 b) a nucleotide sequence which encodes a protein comprising the amino 

acid sequence of FATP4 in SEQ ID NO:53; and 
c) a nucleotide sequence which encodes a protein comprising the amino 

acid sequence of FATP6 in SEQ ID NO:57. 
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13. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) the nucleotide sequence of FATP2 in SEQ ID NO:48; 

b) the nucleotide sequence of FATP4 in SEQ ID NO:52; and 
5 c) the nucleotide sequence of FATP6 in SEQ ID NO:56. 



14. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP2 in SEQ ID NO:48; 
10 b) a nucleotide sequence which is complementary to the nucleotide 

sequence of F ATP 4 in SEQ ID NO:52; and 
c) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP6 in SEQ ID NO:56. 

15. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
1 5 from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP2; 

b) a nucleotide sequence which consists of the coding region of FATP4; 
and 

c) a nucleotide sequence which consists of the coding region of FATP6. 



2 0 16. A host cell comprising the vector of Claim 15. 

17. An isolated nucleic acid molecule comprising a nucleotide sequence which 

encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 
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nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:48, SEQ ID NO.:52, and SEQ ID NO: 56. 

18. A vector comprising a nucleic acid comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 

5 amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 

molecule hybridizes under high stringency conditions to a complement of a 
nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:48, SEQ ID NO:52, and SEQ ID NO:56. 

19. A host cell comprising the vector of Claim 8. 

10 20. A method for producing a polypeptide which is a naturally occurring allelic 
variant of a polypeptide consisting of the amino acid sequence of a fatty acid 
transport protein, said method comprising culturing the host cell of Claim 19 
under conditions in which the nucleic acid molecule is expressed, thereby 
producing the polypeptide. 

15 21. A vector comprising a nucleic acid having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide consisting of an amino acid 
sequence selected from the group consisting of SEQ ID NO:49 s SEQ ID NO:53, 
and SEQ ID NO:57. 

22. A host cell comprising the vector of Claim 21 . 



2 0 23. 



A method for producing a polypeptide, said method comprising culturing the 
host cell of Claim 22 under conditions in which the nucleic acid molecule is 
expressed, thereby producing the polypeptide. 
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24. A vector comprising a nucleic acid encoding a fatty acid transport protein having 
an amino acid sequence sharing at least about 95% amino acid sequence 
similarity with an amino acid sequence selected from the group consisting of 
SEQ ID NO:49, SEQ ID NO:53, and SEQ ID NO:57. 

5 25 . A host cell comprising the vector of Claim 24. 

26. A method for producing a fatty acid. transport protein, said method comprising 
culturing the host cell of Claim 25 under conditions in which the nucleic acid 
molecule is expressed,. thereby producing the fatty acid transport protein. 

27. A vector comprising a nucleic acid encoding a fusion polypeptide, said nucleic 
1 o acid comprising a nucleotide sequence which encodes a contiguous portion of at 

least about 15 amino acids of a sequence selected from the group consisting of 
SEQ ID NO:49, SEQ ID NO:53, and SEQ ID NO:57. 

28. A host cell comprising the vector of Claim 27. 

29. A method for producing a fusion polypeptide, said method comprising culturing 
15 the host cell of Claim 28 under conditions in which the nucleic acid is expressed, 

thereby producing the fusion polypeptide. 



An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP1 in SEQ ID NO:47; 

b) a nucleotide sequence which encodes a protein consisting of the amino 
acid sequence of FATP3 in SEQ ID NO:51; and 
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c) a nucleotide sequence which encodes a protein consisting of the amino . 
acid sequence of FATP5 in SEQ ID NO:102. 

3 1 . An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of; 

5 a) the nucleotide sequence in SEQ ID NO:46; 

b) the nucleotide sequence in SEQ ID NO:50; and 

c) the nucleotide sequence in SEQ ID NO : 1 0 L 

32. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

10 a) a nucleotide sequence which is complementary to the nucleotide 

sequence of FATP1 in SEQ ID NO:46; 

b) a nucleotide sequence which is complementary to the nucleotide 
sequence of FATP3 in SEQ ID NO:50; and 

c) a nucleotide sequence which is complementary to the nucleotide 
15 sequence of FATP5 in SEQ ID NO:10L 

33. An isolated nucleic acid molecule comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP1 ; 

b) a nucleotide sequence which consists of the coding region of FATP3; 
2 0 and 

c) a nucleotide sequence which consists of. the coding region of FATP5. 

34. An isolated nucleic acid molecule comprising at least 15 contiguous nucleotides 
of a nucleotide sequence selected from the group consisting of: 

a) SEQ ID NO:46, or of the complement thereof; 
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b) SEQ ID NO:50, or of the complement thereof; and 

c) SEQ ID NO : 101, or of the complement thereof. 

35 . An isolated nucleic acid molecule comprising a nucleotide sequence which 

' encodes a contiguous portion of at least about 1 5 amino acids of a sequence 
5 selected from the group consisting of SEQ ID NO:47, SEQ ID NO :5 1 , and SEQ 

ID NO: 102. 

36. An isolated nucleic acid molecule comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 

10 molecule hybridizes under high stringency conditions to a complement of a 

sequence selected from the group consisting of SEQ ID NO:46, SEQ ID NO:50, 
andSEQIDNO:101. 

37. An isolated nucleic acid molecule having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide comprising an amino acid 

15 sequence selected from the group consisting of SEQ ID NO:47, SEQ ID NO:51, 

and SEQ ID NO: 102. 

38. An isolated nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide, wherein said nucleotide sequence is at least 90% identical to the 
nucleotide sequence of a nucleotide sequence selected form the group consisting 

20 of SEQ ID NO:46, SEQ ID NO:50, and SEQ ID NO: 101, and wherein said 

percent identity is calculated using the GAP program in the GCG software 
package, using a gap weight of 5.000 and a length weight of 0. 100. 
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39. An isolated nucleic acid encoding a fatty acid transport protein having an amino 
acid sequence sharing at least about 95% amino acid sequence similarity with an 
amino acid sequence selected from the group consisting of SEQ ID NO:47, SEQ 
IDNO:51, and SEQ ID NO: 102. 

5 40. An isolated nucleic acid molecule encoding a fusion polypeptide, said nucleic 

acid molecule comprising a nucleotide sequence encoding a portion of an amino 
acid sequence selected from the group consisting of SEQ ID NO:47, SEQ ID 
NO:51 5 and SEQ ID NO;102, and further comprising a nucleotide sequence 
encoding a heterologous portion of said fusion polypeptide. 

10 41 . A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

a) a nucleotide sequence which encodes a protein comprising the amino 
acid sequence of FATP1 in SEQ ID NO:47; 

b) a nucleotide sequence which encodes a protein comprising the amino 
15 acid sequence of FATP3 in SEQ ID NO:51; and 

c) a nucleotide sequence which encodes a protein comprising the amino 
acid sequence of FATP5 in SEQ ID NO: 102. 

42. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 

2 0 a) the nucleotide sequence of F ATP 1 in SEQ ID NO:46; 

b) ' the nucleotide sequence of FATP3 in SEQ ID NO:50; and . 

c) the nucleotide sequence of FATP5 in SEQ ID NO: 101 . 

43 . A vector comprising a nucleic acid comprising a nucleotide sequence selected 
from the group consisting of: 
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a) a nucleotide sequence which is complementary to the nucleotide 
' sequence of FATP1 in SEQ ID NO:46; 

b) a nucleotide sequence which is complementary to the nucleotide 
sequence of F ATP 3 in SEQ ID NO:50; and 

5 c) a nucleotide sequence which is complementary to the nucleotide 

sequence of F ATP 5 in SEQ ID NO:101. 

44. A vector comprising a nucleic acid comprising a nucleotide sequence selected 
froin the group consisting of: 

a) a nucleotide sequence which consists of the coding region of FATP1; 
10 b) a nucleotide sequence which consists of the coding region of FATP3 ; 

and 

c) a nucleotide sequence which consists of the coding region of FATP5. 

45 . A host cell comprising the vector of Claim 44. 

46. An isolated nucleic acid molecule comprising a nucleotide sequence which 
15 encodes a naturally occurring allelic variant of a polypeptide consisting of the 

amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement.of a 
nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:46, SEQ ID NO:5Q, and SEQ ID NO:101. 

2 0 47. A vector comprising a nucleic acid comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of a fatty acid transport protein, wherein said nucleic acid 
molecule hybridizes under high stringency conditions to a complement of a 
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nucleic acid molecule consisting of a sequence selected from the group 
consisting of SEQ ID NO:46, SEQ ID NO:50, and SEQ ID NO:101 . 

48 . A host cell comprising the vector of Claim 47 . 

49. A method for producing a polypeptide which is a naturally occurring allelic 
variant of a polypeptide consisting of the amino acid sequence of a fatty acid 
transport protein, said method comprising culturing the host cell of Claim 48 
under conditions in which the nucleic acid molecule is expressed, thereby 
producing the polypeptide. 

50. A vector comprising a nucleic acid having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide consisting of an amino acid 
sequence selected from the group consisting of SEQ ID NO:47, SEQ ID NO:51 ? 
and SEQ ID NO: 102. 

51. A host cell comprising the vector of Claim 50. 

52. A method for producing a polypeptide, said method comprising culturing the 
host cell of Claim 5 1 under conditions in which the nucleic acid molecule is 
expressed, thereby producing the polypeptide. 

53. A vector comprising a nucleic acid encoding a fatty acid transport protein having 
an amino acid sequence sharing at least about 95% amino acid sequence 
similarity with an amino acid sequence selected from the group consisting, of 
SEQ ID NO:47> SEQ ID NO:51, and SEQ ID NO:102. 

54. A host cell comprising the vector of Claim 53 . 
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55. A method for producing a fatty acid transport protein, said method comprising 
culturing the host cell of Claim 54 under conditions in which the nucleic acid 
molecule is expressed, thereby producing the fatty acid transport protein. 

56. A vector comprising a nucleic acid encoding a fusion polypeptide, said nucleic 
acid comprising the nucleotide sequence which encodes a contiguous portion of 
at least about 15 amino acids of a sequence selected from the group consisting of 
SEQ ID NO:47, SEQIDNO:51, and SEQ ID NO: 102, said nucleic acid further 
comprising a nucleotide sequence encoding a heterologous portion of said fusion 
polypeptide. 

57. A host cell comprising the vector of Claim 56. 

58. A method for producing a fusion polypeptide, said method comprising culturing 
the host cell of Claim 57 under conditions in which the nucleic acid is expressed, 
thereby producing the fusion polypeptide. 

59. Isolated FATP2 or a functional portion thereof. 

60. An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:49. 

61. An isolated polypeptide comprising an amino acid sequence which is at least 
95% identical to the amino acid sequence of SEQ ID NO:49. 



62. An isolated polypeptide comprising an amino acid sequence which is at least 
97%o identical to the amino acid sequence of SEQ ID NO:49. 
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63. Isolated polypeptide encoded by a nucleic acid molecule comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP2, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 

5 consisting of SEQ ID NO:48 under high stringency conditions. 

64. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:49. 

65. A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
1 0 amino acid sequence of FATP2 in SEQ ID NO:49; 

b) a polypeptide consisting of an amino acid sequence which is at least 95% 
identical to the amino acid sequence of SEQ ID NO:49; 

c) a polypeptide consisting of an amino acid sequence in SEQ ID NO:49; 
and 

15 d) a peptide comprising a contiguous portion of at least about 15 amino acid 

residues of any of the foregoing. 

66. The fusion protein of Claim 65 wherein the fusion protein transports fatty acids 
' across a cell membrane or an artificial cell membrane system. 

67. The fusion protein of Claim 65, further comprising an affinity ligand. 
2 0 68. Isolated FATP4 or a functional portion thereof. 



69. 



An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:53. 
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70. An isolated polypeptide comprising an amino acid sequence which is at least 
95% identical to the amino acid sequence of SEQ ID NO:53. 

71. . An isolated polypeptide comprising an amino acid sequence which is at least 

97% identical to the amino acid sequence of SEQ ED NO:53. 

5 72. Isolated polypeptide encoded by a nucleic acid molecule comprising a 

nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP4, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 
consisting of SEQ ID NO:52 under high stringency conditions. 

10 73. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:53, 

74. A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP4 in SEQ ID NO:53; 
15 b) a polypeptide consisting of an amino acid sequence which is at least 95% 

identical to the amino acid sequence of SEQ ID N053; 

c) a polypeptide consisting of an amino acid sequence in SEQ ID NO: 53; 
and 

d) a peptide comprising a contiguous portion of at least about 1 5 amino acid 
2 0 residues of any of the foregoing. 



75. 



The fusion protein of Claim 74 wherein the fusion protein transports fatty acids 
across a cell membrane or an artificial cell membrane system. 
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76. The fusion protein of Claim 74, further comprising an affinity ligand. 

77. Isolated FATP6 or a functional portion thereof. 

78. An isolated polypeptide comprising an amino acid sequence which is at least 
90% identical to the amino acid sequence of SEQ ID NO:57. 

5 79. An isolated polypeptide comprising an amino acid sequence which is at least 
95% id.entical to the amino acid sequence of SEQ ID NO:57. 

80. An isolated polypeptide comprising an amino acid sequence which is at least 
97% identical to the amino acid sequence of SEQ ID NO:57. 

8 1 . Isolated polypeptide encoded by a nucleic acid molecule comprising a 

10 nucleotide sequence which encodes a naturally occurring allelic variant of a 

polypeptide consisting of the amino acid sequence of FATP6, wherein said 
nucleic acid molecule hybridizes to a complement of a nucleic acid molecule 
consisting of SEQ ID NO: 56 under high stringency conditions. 

82. An isolated polypeptide comprising an amino acid sequence in SEQ ID NO:57. 

15 83 . A fusion protein comprising a polypeptide or peptide selected from the group 
consisting of: 

a) a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP6 in SEQ ID NO:57; 

b) a polypeptide consisting of an amino acid sequence which is at least 95% 
2 0 identical to the amino acid sequence of SEQ ID N057; 
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c) a polypeptide consisting of an amino acid sequence in SEQ ID NO: 57; 
and 

d) a peptide comprising a contiguous portion of at least about 1 5 amino acid 
residues of any of the foregoing, 

5 84. The fusion protein of Claim 83 wherein the fusion protein transports fatty acids 
across a cell membrane or an artificial cell membrane system. 



85. The fusion protein of Claim 83, further comprising an affinity ligand. 

86. A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:49 or SEQ ID NO:53, comprising the steps of 

1 0 contacting the agent with the isolated protein under conditions appropriate for 

binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex. 

87. The method of Claim 86 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

15 88. The method of Claim 86 wherein the isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which inhibits interaction between an. isolated 
protein comprising amino acid sequence SEQ ID NO:49, or SEQ ID NO:53, and 
further comprising a ligand of said protein, comprising: 
(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 



89. 
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(3) a candidate agent to be assessed for its ability to inhibit 

interaction .between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) - 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and tinder the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) a:ad the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

90. The method of Claim 89 wherein (a) is performed in an artificial membrane 
system. 

91 . The method of Claim 89 wherein said isolated protein is in isolated plasma 
membrane. 

2 0 92. A method for identifying an agent which binds to a protein, said protein encoded 
by (1) a polynucleotide comprising a nucleotide sequence which encodes a 
naturally occurring allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP2, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO: 48 under high stringency conditions, 

2 5 or by (2) a polynucleotide comprising a nucleotide sequence which encodes a 
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natuxally occurring allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP4, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO: 52 under high stringency conditions, 
comprising the steps of isolating the protein, contacting the agent with the 
5 isolated protein under conditions appropriate for binding of the agent to the 

isolated protein, and detecting a resulting agent-protein complex. 

93 . The method of Claim 92 wherein the step of contacting the agent with the 
isolated protein is performed in an artificial membrane system. 

94. The method of Claim 92 wherein the isolated protein is in isolated plasma 
10 membrane. 

95. A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein being encoded by (i) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the ammo acid sequence of FATP2, wherein said 

1 5 polynucleotide hybridizes to a complement of a polynucleotide consisting of 

SEQ ID NO:48 under high stringency conditions, or by (ii) a polynucleotide 
having a nucleotide sequence which encodes a naturally occuning allelic variant 
of a polypeptide consiting of the amino acid sequence of FATP4, wherein said 
polynucleotide hybridizes to a complement of a polynucleotide consisting of 

2 0 SEQ ID NO:52 under high stringency conditions and (2) a ligand of said protein, 

comprising: 
(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 
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(3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligaaid of (2), 
under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) . 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under die same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1). and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

96. The method of Claim 95 wherein (a) is performed in an artificial membrane 
system. 

97. The method of Claim 95 wherein said isolated protein is in isolated plasma 
membrane. 

2 0 98. A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein comprising an amino acid sequence 
sharing at least about 95% amino acid sequence similarity with the amnio acid 
sequence in SEQ ID NO:49, or SEQ ID NO:53 comprising the steps of isolating 
the protein, contacting the agent with the isolated protein under conditions 
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appropriate for binding of the agent to the isolated protein, and detecting a 
resulting agent-protein complex. 

99. The method of Claim 98 wherein the step of contacting the agent with isolated 
protein is perfoimed in an artificial membrane system. 

5 100. The method of Claim 98 wherein the isolated protein is in isolated plasma 
membrane, 

101 . A method for identifying an agent which inhibits interaction between (i) an 

isolated protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 
1 0 sequence similarity with the amino acid sequence in SEQ ID NO:49, or (ii) a 

! ' protein encoded by a nucleic acid encoding a fatty acid transport protein 

comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO;53 and a ligand 

t 

of said protein, said method comprising: 
1 5 (a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 

2 0 under conditions appropriate for interaction between the said 

. protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 

2 5 interaction of said protein of (1) and the ligand of (2) occurs in the 
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absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
5 (2) occurs is less in the presence of the candidate agent than in the absence of the 

candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

102, The method of Claim 101 wherein (a) is performed in an artificial membrane 
system. 

10 103. The method of Claim 101 wherein said isolated protein is in isolated plasma 
membrane. 

104. A method for identifying an agent which is an inhibitor of fatty acid uptake by 
(i) a protein encoded by a polynucleotide comprising a nucleotide sequence 
which encodes a protein consisting of the amino acid sequence in SEQ ID 
1 5 NO:49, or by (ii) a protein encoded by a polynucleotide comprising a nucleotide 

sequence which encodes a protein consisting of the amino acid sequence in SEQ 
ID NO:53, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 
2 0. b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with up t alee of the 

* fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an hihibitor of fatty 
2 5 acid uptake by said protein. 
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1*05. An inhibitor of fatty acid uptake identified by the method of Claim 104. 

106. The method of Claim 104 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
5 tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
10 an inhibitor of said protein. 

107. An inhibitor of fatty acid uptake identified by the method of Claim 106. 

108. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by (i) a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 

15 consisting of the amino acid sequence of FATP2, wherein said polynucleotide 

hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:48 
under high stringency conditions, or by (ii) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP4, wherein said 

2 0 polynucleotide hybridizes to a complement of a polynucleotide consisting of 

SEQ ID NO:52 under high stringency conditions, comprising the steps of: 

a) - maintaining test cells expressing said polynucleotide in the presence of a 

fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 
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' c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
5 acid uptake by said protein. 

109. An inhibitor of fatty acid uptake identified by the method of Claim 108. 

110. The method of Claim 108 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
1 0 tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable, 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
15 an inhibitor of said protein. 

111. An inhibitor of fatty acid uptake identified by the method of Claim 110. 

112. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by (i) a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 

. 2 0 amino acid sequence similarity with the amino acid sequence in SEQ ID NO:49 

or by (ii) a nucleic acid encoding a fatty acid transport protein comprising an 
amino acid sequence sharing at least about 95% amino acid sequence similarity 
with the amino acid sequence in SEQ ID NO:53, comprising the steps of: 
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a) maintaining test cells expressing said polynucleotide in the presence of a 
" fatty acid and an agent to be tested as an mhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
5 fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

113. An inhibitor of fatty acid uptake identified by the method of Claim 112. 

10 114. The method of Claim 1 12 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
1 5 samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

115. Aii inhibitor of fatty acid uptake identified by the method of Claim 114. 

2 0 116. A method for identifying an agent which is an inhibitor of (i) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:49 or (ii) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:53, comprising the steps of: 
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(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
pro tern and with an agent being tested as an inhibitor of said protein; 

5 (c) culturing a second aliquot of the host cells with fatty acid substrate of 

said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
1 0 the second aliquot is indicative that the agent is an inhibitor of said protein. 

117. The method of Claim 116 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

15 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; and 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by (i) a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP2, wherein said polynucleotide hybridizes to a 
complement of a polynucleotide consisting of SEQ ID NO:48 under high 
stringency conditions, or by (ii) a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
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corisisting of the amino acid sequence of FATP4, wherein said polynucleotide 
hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:52 
under high stringency conditions, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
5 polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

1 0 (d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

119. The method of Claim 118 further comprising the steps of; 
15 a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

2 0 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

1 20. A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by (i) a nucleic acid encoding a fatty acid transport protein 

2 5 comprising an amino acid sequence sharing at least about 95% amino acid 

sequence similarity with the amino acid sequence in SEQ ID NO: 49 or by (ii) a 
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nucleic acid encoding a fatty acid transport protein comprising an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:53, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
5 polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

10 (d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

121 . The method of Claim 120 further comprising the steps of: 
15 - a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

2 0 d) comparing the fatty acids of b) with the fatty acids of c). 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

122. A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:57, comprising the steps of contacting the agent 

2 5 with the isolated protein under conditions appropriate for binding of the agent to 

the isolated protein, and detecting a resulting agent-protein complex.. 
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123. The method of Claim 122 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

124. The method of Claim 122 wherein the isolated protein is in isolated plasma 
■ membrane. 

5 125. A method for identifying an agent which inhibits interaction between an isolated 
protein comprising an amino acid sequence of SEQ ID NO:57, and further 
comprising a ligand of said protein, comprising: 
(a) combining: 

(1) said isolated protein; 
1 0 (2) the ligand of said protein; and 

(3) . a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 
.under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 
15 (b) determining the extent to which said protein of (1 ) and the ligand of (2) 

interact; and 

(c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
2 0 conditions appropriate for interaction of said protein of (1) with the 

ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the. candidate agent is an agent which inhibits interaction 
2 5 between said protein and the ligand of said protein. 
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126. The method of Claim 125 wherein (a) is performed in an artificial membrane 
system. 

127. The method of Claim 125 wherein said isolated protein is in isolated plasma 
membrane. 

5 128. A method for identifying an agent which binds to a protein, said protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a naturally 
occurring allelic variant of a polypeptide consisting of the amino acid sequence 
of FATP6, wherein said polynucleotide hybridizes to a complement of a 
polynucleotide consisting of SEQ ID NO:56 under high stringency conditions, 
1 0 comprising the steps of isolating the protein, contacting the agent with the 

isolated protein under conditions appropriate for binding of the agent to the 
isolated protein, and detecting a resulting agent-protein complex. 

129. The method of Claim 128 wherein the step of contacting the agent with the 
isolated protein is performed in an artificial membrane system. 

15 130. The method of Claim 128 wherein the isolated protein is in isolated plasma 
membrane. 

131. A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein encoded by a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
2 0 polypeptide consisting of the amino acid sequence of F ATP 6, wherein said 

polynucleotide hybridizes to a complement of a polynucleotide consisting of 
SEQ ID NO:56 under high stringency conditions, and (2) a ligand of said 
protein, comprising: 
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(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 

5 interaction between said protein of (1) and the ligand of (2), 

under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

1 o (c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

1 5 wherein if the extent to which interaction of said protein of (1) and the ligand of 

(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

132. The method of Claim 13 1 wherein (a) is performed in an artificial membrane 

2 0 system. 



133. The method of Claim 1 3 1 wherein the isolated protein is in isolated plasma 
membrane. 

134. A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein consisting of an amino acid sequence 

2 5 sharing at least about 95% amino acid sequence similarity with the amino acid 
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sequence in SEQ ID NO:57 comprising the steps of isolating the protein, 
contacting the agent with the isolated protein under conditions appropriate for 
binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex. 

5 135. The method of Claim 134 wherein the step of contacting the agent -with isolated 
protein is performed in an artificial membrane system. 

136. The method of Claim 134 wherein the isolated protein is in isolated plasma 
membrane. 

137. A method for identifying an agent which inhibits interaction between an isolated 

1 o protein encoded by a nucleic acid encoding a fatty acid transport protein 

comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:57 and a ligand 
of said protein, said method comprising: 

(a) combining: 

15 (1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said. 

2 0 protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 

2 5 absence of the candidate agent to be assessed and under the same 



WO 99/36537 



PCT/US99/00182 



-120- S 

conditions appropriate for interaction of said protein of (1) with the 
ligandof(2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
5 candidate agent, the candidate agent is an agent which inhibits interaction 

between said protein and the ligand of said protein. 

138. The method of Claim 137 wherein (a) is performed in an artificial membrane 
system. 

139, The method of Claim 137 wherein said isolated protein is in isolated plasma 
10 membrane. 



140. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a protein consisting of tire amino acid sequence in SEQ ID NO:57, 
comprising the steps of: 
15 a) maintaining test cells expressing said polynucleotide in the presence of a 

fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of die 
fatty acid in suitable control cells; 

2 0 wherein lower uptake of the fatty acid in the test ceils compared to uptake of the 

fatty acid in the control cells is indicative that tire agent is an inhibitor of fatty 
acid uptake by said protein. 



141. 



An inhibitor of fatty acid uptake identified by the method of Claim 140. 
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142. The method of Claim 140 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

.5 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

10 143. An inhibitor of fatty acid uptake identified by the method of Claim 1 42 . 

144. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
consisting of the amino acid sequence of FATP6, wherein said polynucleotide 
1 5 hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:56 

under high stringency conditions, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

2 0 c) comparing uptake of the fatty acid in the test cells with uptake of the 

fatty acid in suitable, control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake "of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 



25 145. 



An inhibitor of fatty acid uptake identified by the method of Claim 144. 
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146. The method of Claim 144 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

5 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

10 1 47. An inhibitor of fatty acid uptake identified by the method of Claim. 146. 

148. A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 
amino acid sequence similarity with the amino acid sequence in SEQ ID NO: 57, 
15 comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid, and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid hi the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
2 0 fatty acid in suitable control cells; , 

wherein lower uptake of the fatty acid ha the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 



149. 



An inhibitor of fatty acid uptake identified by the method of Claim 148. 
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150. The method of Claim 148 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

5 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

10 151, An inhibitor of fatty acid uptake identified by the method of Claim 150. 

1 52. A method for identifying an agent which is an inhibitor of a protein encoded by 
a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ED NO:57, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
1 5 polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; . 

2 0 (d) measuring, in the first and second aliquots, uptake of the fatty acid 

substrate of the host cells; 
wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 



153. 

25 



The method of Claim 152 further comprising the steps of: 
a) administering the agent to one or more test animals; 
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b) measuiiiig exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously .supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 - d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said proteiri 
being encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP6, wherein said polynucleotide hybridizes to a 
complement of a polynucleotide consisting of SEQ ID NO:56 under high 
stringency conditions, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein, and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

155. The method of Claim 154 further comprising the steps of: 
2 5 a) administering the agent to one or more test animals; 



154. 

10 
15 
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b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

156. A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 95% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:57, comprising 
the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturin'g a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

157. The method of Claim 156 further comprising the steps of: 
a) administering the agent to one or more test animals; 

2 5 b) measuring exogenously supplied fatty acids in one or more samples of 

tissue or bodily fluid from suitable control animals; 
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c) measuring exogcnously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c) . 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
5 an inhibitor of said protein. 

158. A method for identifying an agent which is an inhibitor of a fatty acid transport 
protein, comprising the steps of: 

(a) introducing into cells one or more vectors comprising a gene encoding a 
cell surface protein and a nucleic acid encoding the fatty acid transport 

10 protein; 

(b) contacting the host cells with anti-cell surface protein antibody and 
labeled fatty acid substrate of the fatty acid transport protein; 

(c) contacting a first aliquot of the host cells with an agent being tested as an 
inhibitor of the fatty acid transport protein, while leaving a second 

1 5 aliquot of the host cells uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
cell surface protein by detecting the anti-cell surface protein antibody 
bound to the host cells; and 

(e) measuring, in the first and second aliquots, uptake of the fatty acid 
2 o substrate of the host cells identified as expressing the cell surface 

protein; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is' indicative that the agent is an inhibitor of the fatty acid 
transport protein. 



2 5 159. The method of Claim 158 wherein the host cells regulably express the FATP4 
gene. 
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160. The method of Claim 158 wherein the host cells are prokaryotes. 

161. The method of Claim 158 wherein the prokaryotes are E. coli. 

1 62 . The method of Claim 158 wherein the fatty acid is a radioactively labeled fatty 
acid. 

A method for identifying an agent which is an inhibitor of FATP4, comprising 
the steps of: 

(a) introducing into cells one or more vectors comprising a gene encoding a 
cell surface protein and a nucleic acid encoding FATP4; 

(b) contacting the host cells with anti-cell surface protein antibody and 
labeled fatty acid substrate of FATP4; 

(c) contacting a first aliquot of the host cells with, an agent being tested as an 
inhibitor of FATP4, while leaving a second aliquot of the host cells 
uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
cell surface protein by detecting the anti-cell surface protein antibody 
bound to the host cells; and 

(e) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells identified as expressing the cell surface 
protein; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of FATP4. 

164. The method of Claim 163 further comprising the steps of: 
a) administering the agent to one or more test animals; 



5 163. 
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b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

5 d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

165. The method of Claim 163 wherein the cell surface protein is CD2, 

166. The method of Claim 163 wherein the fatty acid substrate is BODIPY-labeled. 

10 1 67. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:48, comprising: 

a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
acid comprising SEQ ID NO:48, under conditions that allow 

1 5 hybridization between 1) and 2) if the sequences of 1 ) and 2) have at 

lea.st about 90% sequence similarity; and 

c) detecting resulting hybrid, nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule comprising at 

2 0 least about 90% sequence similarity to SEQ ID NO:48, has been 

detected. 



168. 



A method for detecting, in a sample of purified nucleic acid, a nucleic acid 
molecule having at least about 90% sequence similarity to SEQ ID NO:48, 
comprising: 
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a) hybridizing 1 ) the sample of purified nucleic acid to 2) purified nucleic 
acid comprising SEQ ID NO:48, under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 
5 b) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule comprising at 
least about 90% sequence similarity to SEQ ID NO:48, has been 
detected. 

10 169. A method for identifying (1) nucleic acid molecules in fixed cells which 

specifically interact with a (2) nucleic acid molecule comprising the nucleotide 
sequence in SEQ ID NO:48, said method comprising the steps of: 
a) adding to the fixed cells die nucleic acid molecule comprising a 
nucleotide sequence in SEQ ID NO:48; 
15 b) incubating the fixed cells under conditions allowing hybridization of (1) 

with (2); 

c) removing the nucleic acid molecule of step a) that has not hybridized; 
and 

d) detecting hybrid molecules comprising (1) and (2). 

2 0 170. - A method for detecting FATP2 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP2 to the sample, and detecting 
agent specifically bound to the FATP2. 



171. 



The method of Claim 170 wherein the agent is an antibody which binds to 
FATP2. 
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172. A method for detecting FATP2 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP2 to the sample, and detecting 
agent specifically bound to the FATP2. 

173. The method of Claim 172 wherein the agent is an antibody which binds to 
5 FATP2. 

1 74. An isolated antibody which binds to a polypeptide having an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence hi SEQ ID NO:49. 

1 75. An isolated antibody which binds to a fatty acid transport protein having the 
1 0 amino acid sequence in SEQ ID NO:49. 

176. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:52, comprising: 

a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
1 5 acid comprising SEQ ID NO:52, under conditions that allow 

hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

c) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 

2 0 suitable control hybridization, then a nucleic acid molecule having at 

least about 90% sequence similarity to SEQ ID NO:52, has been 
detected. 
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177. A method for detecting, in a sample of purified nucleic acid, a nucleic acid 
molecule comprising at least about 90% sequence similarity to SEQ ID NO:52, 
comprising: 

a) hybridizing 1) the sample of purified nucleic acid to 2) purified nucleic 
5 acid comprising SEQ ID NO:52, under conditions that allow 

hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

b) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 
hybrid nucleic acids are detected at a significant level compared to a 

10 suitable control hybridization, then a nucleic acid molecule having at . 

least about 90% sequence similarity to SEQ ID NO:52, has been 
detected. 

178, A method for identifying (1) nucleic acid molecules in fixed cells which 
specifically interact with a (2) nucleic acid molecule comprising the nucleotide 

15 sequence in SEQ ID NO:52, said method comprising the steps of: 

a) adding to the fixed cells the (2) nucleic acid molecule comprising a 
nucleotide sequence in SEQ ID NO:52; 

b) incubating the fixed cells under conditions allowing hybridization of (1) 
with (2); 

2 0 c) removing the nucleic acid molecule of step a) that has not hybridized; 

and 

d) detecting hybrid molecules comprising (1) and (2). 



25 



179, 



A method for detecting FATP4 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP4 to the sample, and detecting 
agent specifically bound to the FATP4. 
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1 80. The method of Claim 179 wherein the agent is an antibody which binds to 
FATP4. 

181. A method for detecting FATP4 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP4 to the sample, and detecting 

5 agent specifically bound to the FATP4, 

1 82. The method of Claim 1 8 1 wherein the agent is an antibody which binds to 
FATP4. 

1 83. An isolated antibody which binds to a polypeptide having an ammo acid 
sequence sharing at least about 95% amino acid sequence similarity with the 

1 0 amino acid sequence in SEQ ID NO; 53. 

1 84. An isolated antibody which binds to a fatty acid transport protein having the 
amino acid sequence in SEQ ED NO:53. 

185. A method for detecting, in a sample of cells, a nucleic acid molecule comprising 
at least about 90% sequence similarity to SEQ ID NO:56, comprising: 

1 5 a) purifying nucleic acid from the cells; 

b) hybridizing 1) purified nucleic acid from the cells to 2) purified nucleic 
acid comprising SEQ ID NO:56, under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 

2 0 c) detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids are detected at a significant level compared to a 
suitable control hybridization, then a nucleic acid molecule having at 
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least about 90% sequence similarity to SEQ ID NO:56 5 has been 
detected. 

1 86. A method for detecting, in a sample of purified nucleic acid, a nucleic acid 

molecule comprising at least about 90% sequence similarity to SBQ ID NO:56, 
5 comprising: 

a) hybridizing 1) the sample of purified nucleic acid to 2) purified nucleic 
acid comprising SEQ ID NO:56 under conditions that allow 
hybridization between 1) and 2) if the sequences of 1) and 2) have at 
least about 90% sequence similarity; and 
10 b) , detecting resulting hybrid nucleic acids in the hybridization; wherein, if 

hybrid nucleic acids axe detected at a significant level compared to a 
suitable control hybridization, then, a nucleic acid molecule comprising at 
least about 90% sequence similarity to SEQ ID NO:56 has been detected. 



1 87. A method for identifying (1) nucleic acid molecules in fixed cells which 
1 5 specifically interact with a (2) nucleic acid molecule having the nucleotide 

sequence in SEQ ID NO:56, said method, comprising the steps of: 

a) adding to the fixed cells the (2) nucleic acid molecule comprising the 
nucleotide sequence hi SEQ ID NO:56; 

b) incubating the fixed cells under conditions allowing hybridization of (1) 
2 0 with (2); 

c) removing the nucleic acid molecule of step a) that has not hybridized; 
and 

d) detecting hybrid molecules comprising (1) and (2). 
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188. A method for detecting FATP6 in a sample of cells, comprising the steps of 
adding an agent that specifically binds to FATP6 to the sample, and detecting 
agent specifically bound to the FATP6. 

1 89. The method of Claim 1 88 wherein the agent is an antibody which binds to 
5 FATP6. 

1 90. A method for detecting FATP6 in a sample of cell lysate, comprising the steps 
of adding an agent that specifically binds to FATP6 to the sample, and detecting 
agent specifically bound to the FATP6. 

191. The method of Claim 190 wherein the agent is an antibody which binds to 
10 FATP6. 

1 92. An isolated antibody which binds to a polypeptide having an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:57. 

1 93. An isolated antibody which binds to a fatty acid transport protein having the 
15 amino acid sequence in SEQ ID NO:57. 

1 94. A method for modulating fatty acid uptake of cells in culture, comprising adding 
one or more agents that modulate fatty acid uptake to cells comprising one or 
more fatty acid transport proteins. 



195. 

20 



The method of Claim 194 wherein the agent modulates fatty acid uptake by 
modulating biosynthesis of one or more fatty acid transport proteins. 
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196. The method of Claim 195 wherein the agent modulates fatty acid uptake by 
modulating biosynthesis of FATP6. 

197. The method of Claim 196 wherein the agent is an antisense oligonucleotide. 

198. A method for inhibiting fatty acid uptake in the small intestine of a mammal, 

5 comprising administering to the mammal a therapeutically effective amount of 

an agent which is an inhibitor of fatty acid uptake by a fatty acid transport 
protein in the small intestine of the mammal. 

1 99. The method of Claim 1 98 wherein the agent is administered orally. 

200. The method of Claim 198 wherein the fatty acid transport protein is hsFATP6. 

10 20 1 . A method for inhibiting fatty acid uptake in cardiac muscle of a human 

comprising administering to the human a therapeutically effective amount of an 
agent which is an inhibitor of fatty acid uptake by FATP6. 

202. A method for directing an agent to cardiac muscle in a mammal, comprising 
administering to the mammal a complex which comprises the substance and a 
1 5 moiety which binds to FATP6. 



203, 



A method for directing an agent to liver in a mammal, comprising administering 
to the mammal a complex which comprises the substance and a moiety which 
binds to FATP5. 
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204. A method for detecting a variant allele of a human FATP gene, comprising: 

a) preparing amplified, purified reference DNA encoding all or a portion of 
a FATP from a human, and amplified, purified test DNA encoding all or 
a portion of the FATP from a human to be tested as having a variant 

5 allele; 

b) determining whether the reference DNA and test DNA differ in DNA 
sequence; 

wherein, if the test DNA differs in sequence from the reference DNA, the test 
DNA comprises a variant allele of a human FATP gene. 
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■scFATP 



mmFATPI 
rnFATPI 
•hsFATP4 

•mmFATP4 
■ceFATPa 
— ceFATPb 



4~ 



dmFATP 
hsFATP2 
mmFATP2 
rnVLACS 
hsFATP6 



— frFATP 

■hsFATPI 



•hsFATP3 
— mmFATP3 
— mtFATP 



•hsFATPS 
— mmFATP5 
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mmFATP3 DNA sequence 

Qcgz&g^^ SO 

CmjCI ^^^ 20Q 



PCT/US99/00182 



WO 99/36537 

7/U7 

^r«rvrmri ggy 



S^SSo^^ 760 

TIQ3::]CT3 ^^ 1680 

Jiacid;33 ^^ 1760 

Kig^icaTE^^ 2040 r- ^ 

2087 
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mUEMiEG protein sequence 
nmEaiP4 EE®, sequence 

CEMGC; ^^ 200 

Og^^Qgai^ 240 

CfiGIcaDa ^ 320 

jmzE£ ^^ 360 

T B^g 3 iG30GC&^^ 400 

©CiEGCJmXr^^ 480 

TC IQ 0332IMCro^ 640 

Qg^^jg^^T^^ 720 
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as^soayrrosiQGSio^^ 96 ? n 

^j^g^n^ iooo 

cRMaaom^ 1040 

^£300^303303^^ 1200 

iansiGnsosnm^i^ 1240 

g 3CIGC 3]3riCX3i&^^ 1280 

^^rroc^^^ 1320 

aaaaaaacEicr^^ 3360 

ooGGsasnnsc^^ 1400 

^ggCTjjoacx^^ 1440 

G3ai330CnJ ^^ 1520 

■Earnnrcm^^ 1560 

Gsmr a SfflCir ^^ isoo 

^OgjgCTgTC!^ 1680 

cjocegctoxos^^ 1720 

0332££|333IT]XJ^ 1800 

QGSKEaQCDSSGaCSQS^ 1840 

.^^^^^^ 1880 

^^qqjjjjC^^ 1320 

^oaGcisMOSia^^ 2000 

c^as&ao^ -2040 

c^grcsoaaa^ 2080 

Q XTlGllLUllUlLia^ 2120 

TCCGroSCTCCTGOLa 2150 

^j3IGE33CICaQ333aSA^^ 2240 

j^aAAAAMAaJ^AAAAAA 2301 ^ ^ IQ^> 
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protein sequence 
^^^^S^K^^ITOM, 320 

jasagxavMOT^ 507 ^ lCVJLML I 

• nrtS2fflP5 EN& Sequence 

240 

^03^333^^ 560 
aaGGGKEQDOC^^ 720 



WO 99/36537 

n^aGcnma^^ 1000 

^Saaaan^^ 1040 

Sg^^^ 1080 

SSazooaG^^ 1280 

^S^ga^a^^^ 1400 

QTracnaQQaotf^^ 1520 

^ewccx^tcci^^ 1600 

1540 

..^CTICITCEC^^ 1720 

angsnasMossaa^^ + 750 

j ^^ mGIC] ^^^ 1800 

TowraSIGim^ 1840 

asnggoGcmra^^ 1880 

0330^00?^^ 1920 

a::raro03 oDC^^ 3360 

g^jsaxzoftzoazEO^^ 2000 

GIQ T 3 IGB£32ITI ! ^^ 2040 

TCI3 y 3ffliCl ^ 2080 

gbsggcsgki^^ 2120 

^jjj^^j^^^ 2160 

•^^soasirn^^ 220a 

QnDaSiTOG0 ^^ 2240 

cn^nsiGEfls^^ 2277 



WO 99/36537 

niriB3aP5 protein sequence 

j^gjigjljGELQ:^^ 8 ° 

or«rwT™rM*ss»SH^^ 360 



hsEKCE2 22SR sequence 

jeq^sogsaj^^ 200 

g^CXQ^OS^I^^ 280 

TO3IGa[2iCI ^^ 320 
^^^^^^S^Sl^^^^^SciG 440 

QOCT^SOGSS^^ 800 
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SScwh^^ 380 




^^^^^GC^K™^ 1440 
fcsEKiE2 protein sequence 

,^r W iw^T-Tv*/Mr2inT'^^ 240 



QGBCBP 286 ^T^oJtX- ^ 



hsEKEP3 sequence 



PCT/US99/00182 



WO 99/36537 
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^^^QGASICfiC^a^^^ 280 

480 
520 
560 



^ '-^^'^g]GICr®G?^ 360 

PC£J33aXX3iQ3 400 
440 

&3£8JB3inr 480 
Ef3GIG32I3Mr 520 
X2D2IQ3C3a3 560 
2B303C3£EI5 600 



bsE3CCP3 protein sequence 



LVC 40 
hsEKEP4 ED®, sequence 



PCT/US99/00182 

WO 99/36537 - 
^.——^^ 560 

QCrnaCIC&GSrc 734 W^UJ^ 0 




fcsPHIP5 EHA seguence 

O^yjQ^Qgggc^^ 360 
0333330003*332^^ 

Q: j igia 3 gKy333 ^^ 600 



PCTAJS99/00182 



WO 99/36537 



16/U7 



1120 



hsE3ffiP5 protein sequence 
ijsE&EPS ENA. sequence 

( ^ 303aMr: ^^ 280 

GKD3rEffiI a a ^^ ..680 



WO 99/36537 
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<33MGC!MOka3»ca^^ 880 

Ca OCaQaa ^lUmtjAg^^ 1000 

AaAciraaasmrcrc^^ 1040 

CnaGSftaGaGTC^ 1080 

■ KK3333 ^ G3 ^^ 1120 

■riTAj^ri^TiTJ^TKS^^ 12QQ 

LmTim uimiuHg^^ 1240 

triu^igjaBcrafl^^ i28~o 



H&xrrxcns&n^ 23^0 



protein sequence 

^■^■^-^ 150 
H35aGPmi ^^ 200 

H0SEIELYEM2CKKS2VI^^ 335 



rasioaiaz^^ 40 

Caa£Aft£^ lg 0 

Aro3 G:a ^^^a ^ 280 

miaai ^^ 320 




A 1361 




^Xc^VJLA-fi. "3-3 



ifltEKGP sequence. 
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CPOS*^^"*^^ ^^^nr^gjjjj 640 



cracS^ 720 

SSSS^^ 760 

SSxa^^ 880 

SSS^^ 920 

^20 

TO33CIQ 3&^^ 1520 

oggocagga^ 1680 

Gonjfflscam^^ 1720 

jdaaas&Eac^^ 1880 

TicAftamsimr^^ 2000 ' 

Q33TIBG2007 ^"v^OXJ- 
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ratPSTP protein sequence 

xnscfeygg^ttvrM^ 40 

aipnskasigta^c^^ 80 

taniyaavlaazgvg^^ 12 o 

gaiagmlnyh^ 160 

cgas^gprragcfr^^ 200 

tafyiftsgttgiz^^ 240 

sdtlysclp^^ 280 

f^5es7±ariratafvy±ge± 320 

cgnglxped^^ -fn 350 

glllsrvnrlqpfdg^^ 440 

S^rnsp^rrgmafv^ 430 

' dqt^escti^ygvgip 520 

ri =^Y^P9]^ 560 

gadie^lyvlagpdegyvp^ 597 
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baFATPl 

1 teg m ue gge gee egg gae eee ui gca gaa gec cge aca gca ggc aca gcg cac cea 
61 aga agg gtc cib gag tee gca pw aca gaa agg eec eeg gee eca gec tee tag tee ceg 
121 cet gee tec tgc ctg age tec tgg gag act gaa gge a eg get egc age ttc agg atg egg 

K It 

IfiL gee ceg ggt gcg ggc gcg gec teg geg gee teg ctg gcg ctg ctg egg ctg ctg egg ctg 

A P Q A 0 AAG VV S LA L L tt L L 0 I* 
241 eeg tgg arc tgg age gcg gca geg gcg etc ggc geg cae geg gge age ggc gge egg cge 

PWTWSAAAALGyYVGSGGWA 
101 ttc etg ego ate gec tgc aag ace geg tgg eg* gac etc ttc ggc ccc tct gtg ctg ate 

FIRIVCKTARROLFQLSVLI 
Ml ego geg cge etg gag eeg egg egg eac cag egt gec ggc cae ace ate eeg cge ate tct 

RVRL-'BLRHKORAGHTl P R I F 
421 cag gcg gta geg cag cga cag cce gag cge ctg geg eeg gtg gat gee ggg aee gge gag 

OAVVQROPERLALV, DAOTOt 
481 tgc egg ace ttt geg cag ctg gac gee tae eec aae geg gta gee aae etc ccc cge cag 

CWTFAQLBAYSNAVAKL FRO 
341 ctg gge etc gcg ceg gge gac gcg gcg gec ate tee ctg gag ggc egg ceg gag cte gtg. 

LGPAPODVVAXFLBGRPE FV 

Ml ggg ceg tgg etg ggc ctg gec aag gcg ggc atg gag gec geg ceg etc aac geg aae ceg 

OLWLOLAKAGHSAA X. LHV HL 
661 egg cge gag cce ctg gee tec tgc eeg ggc aee ceg ggc get «ag gee etg ate tee gga 

R It £ P L A F C LO T SO A K ALI P 0 
721 gga gaa atg gtg gcg gcg gtg gec gaa gcg age ggg cae ctg ggg aaa age ctg ate aae 

0EHVAAVAEV3GHL OKSLX.R 
TBI etc tgc tct gga gac teg ggg ccc gag ggc ace ttg eeg gae ace cac cte etg gac ceg 

p .CSQDliOPCaiLPDTHLLDP 
8tl ctg etg ass gag gec tee aee gee eec ttg gca cag ate ccc age aag gge atg gae gat 

LLKEASTAPLAQIPSXGMDD 
901 cgt etc etc coc ate eac acg eeg ggg ace acc ggg ctg ccc aag gee gee ate gtc gcg 

R L F Y J Y ? S OTT CL P K A A X V V 
951 cae age agg tae cac cge acg gca gee tee gge cac eac gec cac cge acg cag gcg gec 

H8RYYRMAAFGHH .AY RHQAA 
102 1 gae geg etc tae gac cge ceg ccc ctg tac eac ceg gca gga aac ate ate ggc gcg ggg 

0 V LY DC L P L YH SA O HI 2 G V 0 
1081 cag tgt ccc aee cac ggg ctg aca gec gee cte cge aag aaa ccc teg gee age cge ccc 

OCL1YOLTVVLRK XFSASRF 
1141 egg gac gae ego acc aag cac aac tgc acg geg get cag tac ate ggg gag acc tgc cge 

W.DDCXKYNCTVVQYI OEICR 
1201 cac ceg ceg aag cag ceg gcg cge gag gcg gag agg cga cac cge gcg egc etg gcg geg 

YLLKQPVREAEHRHRVRLAV 
1361 gg; aac ggg ceg cue ccc gec acc egg gag gag tec acg gag cge ccc ggc gca cge caa 

C M 0 L R P A I H EE.FTE RF 0 V R 0 
1321 aee ggg gag etc cac gge gec acc gag cge aac cge age acc gec aac acg gae ggc aag 

10 EFYGATECNCS TANKDG X 
1381 gtc ggc c=e tjc ggc tec aac age cge ace ctg cce cac geg eac ccc acc egg ctg gcg 

VOSCCFNSBILPHVYPIRLV 
1441 aag gec aae gag gac aca acg gag ceg ceg egg gac gec cag ggc etc tgc ate ccc tgc 

KVMEDTME LLRDAQGLCIPC 
1501 cag gee ggg gag ccc ggc ccc etc gcg ggc cag ace aac caa cag gae ceg ceg cge egc 

QAGEPGLLVGOXNQQDPLRR 
- 1561 etc gae ggc cae gtc age gag age gee acc age aag aag ate gec cac age gec ttc age 

FDQYV5ES ATS R X I AHSVFS 
1621 aag ggc gac age gec cac etc tea ggc gac gcg eta gtg atg gac. gag ceg ggc tae atg 

XO O3AYLS0DVLVKDELOYM 

1GB1 tae tec egg gac egc age ggg gac acc tec egc egg cga ggg gag aac gec ccc acc acc 

- Y F ROR SCO T FR WRG EHVSTT 
1741 gag geg gag ggc gcg ceg age egc ceg ceg ggc cag aca gac gcg gec gee eat ggg gcg 

EV E G V L S R LLOOTDVAVYOV 
1801 get gec eca gga gcg gag ggc aag gca ggg acg gcg gec gec gca gac ccc eac age ceg 

AVP0VEGKAQHAAVADPH5L 
1861 ceg gac ece aac gcg aca tae cag gag ceg cag aag gcg etg gca ccc eat gec egg ccc 

LDPHAIYOBLQKVLAPYARP 
1921 acc tec eeg egc ccc ceg ccc cag gcg gac acc aca gge acc ece aag acc cag aag acg 

IFLRLLP0VDTT0TFKIOKT 
19B1 agg ceg cag ega gag ggc ttc gae eca egc cag acc tea gac egg etc tec ccc ceg gac 

RL0RE6FDPRQTSDALFFLD 
2041 ctg aag cag ggc cac cac ceg ccc tea aae gag gca gec eac acc cge aee egc teg gge 

LKOdHYLPLHEAVYTRXCSG 
2101 gee etc gec etc caa age egt eec tee aee gge cae aaa eec egg gee egg egg gag agg 

A r A L * 

2161 eca get ega gec aga cag egc cge cea ggg gtg gee gee tag tae aca ccc ace tgg ceg 
2221 age egx acc egg cac ggc eca cce egg act gag aaa ceg gaa cce cag agg aae ceg tgc 
2381 cce ecu gee gec teg gtg ccc ceg cge ceg ccc cce etc ece gec etc eag ect ceg ccc 
2341 eec ece ate ect gee eee gee egg ect eaa eec tee ccc etc ccc cce tee ttt ece eec 
24Q1 tte ttt ttt etc aag aca gag ect cae tee get gee egg get aga gtg cag egg egg gae 
2461 cce ggc tea ctg caa ect etg ect ccc ggg get caa geg ate cte eca eee eag ccc ece 
2521 gag cag ceg gga eta cag gca cce gee acc acg ccc age caa ctt eta eae tee tag tag 
2581 aga egg gge etc acc atg ttg gec agg ceg gtc eeg aac ece tga ect cag gcg ate egc 
2641 tgg cce egg ccc eee aga geg eeg gga eca tag gcg tga gec tec ggc ceg gec tec cce 
2701 ttt tec tee cce ccc ceg ceg aga geg gaa eac acg tgt ccc ggg age egc ate ceg tgc 
2761 agg gec cag ceg cte ctg ggg aee gca gga* acc aee cce cce ggg cce egg ace egg acc 
2821 ggg gee tec eca ccc cce tct egg ceg tge etc acg gag cce eaa ece agg eee ece geg 
2881 get gee ggg etc cag acg ctg cag cte eac gcg act eee aag cag gee etc cge cce cce 
2911 cge ega atg gag gag ceg ggg gee eee cag gec aae egg aaa ate cce cag get agg cea 
3001 act gec ttc tge act tee ceg tte etg tea cac eee ccc age ccc acc tte ccc tec tga 
3061 tge ect gaa age tte egg aat cga ceg ega eea ccc gga tgt eae cac tge cag ccc ctg 
3121 ect tga cgt eec eae tea gee ate tee atg gag etc ceg ctg gag ggc ect gaa ece egc 
3181 aet gcg tgg ctg ceo age cag ceg cce ccc gec ceg gga gga ggc ccc ctg ggc gee cce 
3241 ate egg tge gee tac egg agg gtc eca cag gag agg eag cag agg ggc eag ggg agg eee 
3301 ect gee ggg got egg ect cte aag ece cag ggg eee cag ccc gtt gaa tae aee cea cce 
3361 gge ggg tgg eee etc cga cgt eec cac tga tgg cte ega eae cgt get gge ggc gae gtc 
3421 eea gac aac ccc ace agg acg gee cag aca tec cea ctg get teg eeg gcg get cat cte 
3481 gaa cat eca cge eag ccc tte tgg ggc egg cea cce agg eeg ect gee cgt ceg tee ccc 
3541 cte eag eag cac cce etg gee eee gga geg gcg ggg eca egg caa gag aca ceg tgg egc 
3601 eee atg tga act tte eeg gge act gtg gee eta cce ccc aat ega ece aag aaa eaa acc 
3661 tga aga ceg tee ggt gaa aaa aaa aaa aaa aaa agg gcg gec ge 
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haFATM 

1 cga ecc acg ege eeg gge ggg egg gse en geg gcg ggc gyg set wo ggg gcg gec ggg 

61 eca toe agg gcg cag age egg eta mac cct get gag acc egg etc cgt gcg tec agg ggc 
121 ggc taa tgc ccc tea cge tgt eta egc tgc tge aac egg gee gca tet gga egg ggc gec 
181 gcg egg egg age cga egc egg gec aca atg ctg ctt gga gec tct ccg gtg ggg gtg ctg 

HLLQASLVGVL. 
241 ctg tte ccc tag ctg gtg ecg aaa ctg ccc tgg acc cag gcg gga tte tec ctg ttg ttc 

LFSXLVLXL pWTQVGP.SLliP 
301 etc tac ttg gga tec ggc ggc tgg egc ttc acc egg gee ttc ate aag acc acc agg egc 

LTfcflSGGWRPIRVP I X T I R R 
351 gat ate ctt ggc ggc ctg gtc etc ctg aag gtg aag gca aag gtg cga cag tgc ctg cag 

OXPGGLVLLJCVKAKVRGCLO 
431 gag egg egg aea gtg ccc att ttg ttt gee tct acc gtt egg cge eoe ccc gac aag acg 

KRRTVPILPASTV RRBPDXT 
481 gec ctg acc ttc gag ggc aea gat acc eac tgg ace ttc cge cag ctg gat gag tac tea 

A L I fEQTDTHWT FRQLDEYS 
541 age agt gta gec aac ttc ctg cag gee egg ggc ctg gec teg ggc gat gtg get gee ate 

SSVANPLQARGLASGDVAAI 
601 tte atg gag aac egc aat gag ttc gtg ggc eta tgg ctg ggc atg gec aag ccc ggt gtg 

FHEMRNEFVGLWL GHAKLGV 
661 gag gca gee cce ate aac acc aac ctg egg egg gat get ctg etc eac tge cte acc ace 

EAA LIHTNLRR DA DLHCL T .T 
721 ccg cge gca egg gee ctt gtc ttt ggc age gaa atg gee tea gee ate tgt gag gtc eat 

SRARALVFGSEHASAICEVH 
791 gee age ecg gae ccc teg cte age etc ttc tgc tct ggc tec tgg gag ccc ggc gcg gtg 

ASLDPSLSLPCSGSMEPOAV 
841 cct cea age aca gaa eac ctg gac cct ctg ccg aaa gat get ccc aag cae ctt ccc agt 

PPSTEHLDPLLKDAPKHLPS 
SOI tgc cct gac aag ggc tte aea gat aaa ctg tte tac ate cae aea cce ggc ace aca ggg 

CPDKOFTDXL FY tYTSGTTG 
961 ctg cee aag gee gec ate gtg gtg eac age agg tat tac egc atg get gee ctg gtg tac 

LPXAAIVVHSRYYRHAALVY 
1021 tat gga tte cge atg egg ccc aac gac ate gtc tat gac tgc etc cee etc tac cae tea 

YOFRHRPMDIVYDCLPLYHS 
1081 gca gga aac ate gtg gga ate ggc cag tgc ctg ccg cat ggc atg acg gcg gtg act egg 

XGHIVGIGQCULHGHTVV I R 
1141 aag aag etc tea gec tee egg ttc tgg gac gat tgt ate aag tac aac tge acg att gtg 

XKFSASRFWDDC1KYHCTIV 
1201 cag tac att ggt gaa ctg tgc egc tac cte etg aac cag cca ccg egg gag gca gaa aac 

QYI OELCRYLLHOPPREAEH 
1261 cag cae cag gtt egc atg gca eta ggc aat ggc cte egg cag tec ate tgg acc aac ttt 

Q HQ V *- M A L O K G L R O S 1 WT N F 
1321 tee age egc ttc cae ata ccc cag gtg get gag ttc tac ggg gec aca gag tgc aac tgt 

S SR FH 1 P ' Q V A EF Y GATECNC 
1381 age etg ggc aac tte gac age cag gcg ggg gee tgc ggt tte aat age egc ate etg tee 
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2101 gec tae age egc ate cag gca ggc gag gag aag ctg tga tte cee cca tec ete tga ggg 

AYS RIQAGEEXL • 

2161 ccg gcg gat get gga tec gga gec eca ggt tec gee cea gag egg tee tgg aca agg cca 

2221 gae eaa age aag cag gge ecg gca cct cca tec tga ggt get gec cct cea tec aaa act 

22 Bl gee aag tga cte att gee ttc eca acc ctt cea gag get tte tgt gaa agt etc atg tec 

2341 aag tte egt cct ctg ggc tgg gca ggc cct etg gtt cce agg ctg aga ctg acg ggt ttt 

2401 etc agg atg atg tet egg gtg agg gta ggg aga gga caa ggg gte ace gag ccc ttc eca 

2461 gag age agg gag ctt ata aat gga aee aga gca gaa gte ccc aga etc acg aag cca aca 

2521 gag tgg gca ggg aca gtg gta gca tec ate tgg tgg cca aag aga ate gta gec cca gag 

23B1 ctg ecc aag tte act ggg etc eac ccc cae ete cag gag ggg agg aga gga ect gac aee 

2641 tgt agg tgg ccc etg atg ccc cat eta cag cag gag gte agg ace acg cee ctg gee tet 

2701 cee cac tec ccc ate ccc cee ect ggg tgg ctg cct gat tat ccc tea ggc agg gee tct 

2761 cag tec teg tgg gte tgt gtc acc ecc ate tea gtc ttg gec tgg eta tga ggg gag gag 

2821 gaa egg gag agg ggg ete agg ggc caa taa act etg cct tga gte etc cca aaa aaa aaa 

26B1 aaa aaa aaa aaa aaa aaa aaa aaa aaa ggg egg ccg c 
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0 gin 
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T thr 


threonine 


V val 


valine 


H trp 


tryptophan 


X ukur 


unknown 


V tyr 


tyrosine 
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STOP 
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37 
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2.3 
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4. a 
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1.4 

3.7 
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3450 
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4264 
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1781 
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protein sequence 643 a. a. MtAGASt,VGVLL ... AYSRIQAGEEKL 
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alanine 

cysteine 

a spar to. c acid 

glutamic acid 

phenylalanine 

glycine 

hlscidina 

iaaleucme 

lysine 

leucine 

mechianina 

aapexagine 

proline 

glucanons 

arginina 

serine 

threonine 

valln« 

crypcophan 

unknown 

tyrosine 

STOP 



720 IB Qalton 



46 
16 
33 
33 
34 
34 
12 
30 
31 
76 
13 
21 
31 
23 
45 
35 
32 
46 
8 



n(%) 

7.2 
2.S 
5.1 
S.l 
5.3 
8,4 
1.9 
4.7 
4.8 

ii. a 

1.9 
3.3 
4.8 
3.6 
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5.4 
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MpnMnl'RMioino. ShmFATPIaoq^afien. CU^, -ftod w(.h Wetted „,»du. wight 



Pafl6 1 




£xon genanic) 



WO 99/36537 



PCT/US99/00182 



Aflgrtmant Report of Fig. 5 hmFATPIseq MagaEjgn* 
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sinp ChjawJ melhod wilh Wtlghied residua weight 1st 
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IQ51 



1901 Cj 
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hFATPlCOn.seq CRF 
mFftTPl.seQ ORF (fran genanic) 

hFMTloan.seq CRF 
mFTtfPl.seg ORP (fran genomic) 

hFATPloon.seg CRF 
offVOTl.seg CRF (fran genomic) 

hFMTPaccn.seg CRF 
ntFOTPl-seaj CRF I fran ge nan ic) 

hFATPloon.seq CRF 
xnFWrPl.&eg CRF {fxan genanic) 

0 hFATTloon.aeq CRF 
A rnFRTPl.seq CRF (fran genomic) 

hFKTPlcon.seq CRF 
mrxm.Beqr CRF {fran genanic) 

hFWPlccn.seq CRF 
mFflEPl.seq CRF (from genanic) 

hFATFlcai.seq CRF 
iBFATPl.seq CRF {fran genanic) 

hFATPlcon.seq CRF 
AfSi trfKTPl.seq CRF (fran genanic) 

G hFATFlcon.sea. CRF 
h mFKTPl.seq CRF (fran genomic J 

hFATPlcon.seq CRF 
mFATPl.seq CRF (fran genomic) 
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t, using CtusUii mathod wllh Watghled real** weight fcble. 
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Alignment Report ot FATP4 DNA ailgfimenl, using Ouslel method with Weighted residua weigh! table. 
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Altgnnwnl Renort ol UntfUed. using Clt*lal m.lhad vrfih PAM250 mittue weight WW*. 
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Alignm.* Rapon «MWW protein alignment, ualno Clus* method vAh PAM3SQ MM* wdghl tt*. 
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hiFATPfi 

1 aaa gge ai« cu 069 eu cgc uc ui toe, gag tag etc ace cac tag oca cee cag get 

si tea cac cet atg cet eeg sso ceg eat gtt geg «w aac tge gag em aea ee» act tc* 
131- cac agg aac cag eta ega "eat gae cae gaa etc aat aeg act cac cat agg gaa ttt ege 
111 ' eet cga gge eaa gas ttc gge acg agg ggt get 0*4 cee ecg ege ggt 1 tec too tgo oca 
341 gag act gea aac cue ege gee tee eag tea tea tea tee cag ect ttc ecg get ega att 
301 cag ect eu act eaa get ego OOT aaa gu cae etg ag« gga gai aag eee etg tee ccg 
361 gae eet ett etg agg gtg gag eeg gag gee eee tge eee eca gee gee eag cga ceo aag 
421 ett aat etc eag eae eae teg ggg cga eee ttt egg tgc aaa ect acg ate etg tee cee 
481 agg ate cee ccc eae eee gee teg cee egg aaa age ega eaa gaa cee eag gtg taa gee 
3(1 ceg agt age gag gae eeg egg tet ecg tgg aga gee gtg ecc gga aga gaa gga ege tgg 
«l egg ggg eeg aga ec* gag eeg tee tet gge eca gee gee cee atg ete eeg eca tgg eca 

H L 8 H I, 

661 aea gee cc* ggg gee gga atg gee gee etg cae ttc ttg eag aaa ete etg tte eet cac 
TVLOXOMVVLHFLQ X L L T F Y 

721 tee tgg gat gac tee tgg tee geg etg aag gtg gtg etc ace ata att egg etg aag aag 
rHDDFHP V L X V V t, I I X R I* R " X 

781 eat gaa aag aga ggg gag eeg seg aat geg eeg gat aaa etc ttg age eae go: aaa agm 

Y X X R Q E I VTVLOXfL SH AXR 
841 caa ect egg aaa eee tte ate ate eae gag gga gac ate tae acc tat cag gat oca gac 

OPRKPFI IYSO0IYTY0DVD 
901 aaa agg age age aga gtg gee cat gee tte ecg aac eae tee tet etg aaa aag ggg aac 

XR9SRVAHVFLHHSSLXXGD 
981 acg gtg get etg etg acg age feat gag eeg gae tee gtt eae gtg tgg ete gge etc ecc 

TVALLHS H«FOrV.HV*rOLA 
1011 aag etg gge tgc geg gtg gee tee ccc aae ace aac att ege tee aae eee etc ceg aat 

X LGCVVX FLHTHXR SMS L UN 
lOtl tge ace ege gee tgc ggg ecc aga gee eta gtg gtg gge gca gae ttg ett gga aeg gea 

C XRAC0PRALVVQA0LUOTV 
U41 - eaa gaa ate ect eca age etc tea gaa aae ace age get tgg ggg atg aaa gae tee ete 

tEICPSLSEMISVHOMKOSV 
1201 cea caa ggt gta ate tea eee aaa gaa aaa etg age acc tea ccc gat gag eee geg eca 

PQGVXSLKZXl.STSPDSP</P 
1261 ege age eae eat ate etc tea eee ete aag tet acc tgt ccc tac acc tec acc tet gga 

RSHKVVS LLKSTCLYIFTSC 
1321 aea aea ggt cea eea aaa gca get gtg acc age cag ecg cag gtt eca agg gge tec get 

TTOLPXAAVISQLQVLROSA 
1381 gtc etg tgg get tte gge ege ace get cac gac ate get cat ata ace ett ccc etg tat 

V t W A F C C T A H DIVYITLFLT 
1441 cac agt tea gca gec ate eeg gga act eet gga tgt get gag ecg ggt gee ace cgc gtg 

MSB A A I L 0 XSCCVEL' OATCV 
1501 eta aag aag aaa tte eca gea age eag tee tgg age gae tge aag aag tac gat geg ace 

LXKKPSA SQFWSOCXKYOVT 
1.561 geg ttt c«g tac ace gga gaa etc tgt cgc cas ect tge aaa caa tet aag aga gaa gga 

VFOV ICELCAYLCKQSXKtC 
1621 gaa aag gae cat aag geg cgc ttg gca act gga aae gge ata egg agt gat gta tgg aga 

fXOHRVRLAlGNOiaSDVWA 
1681 gaa tec tta gac aga cee gga aac aca aag geg tgc gaa ete cat gca gee ace gaa tea 

EFLDH PGNIXVCELYAATEa 
1741 age aca ect ete acg aae tae act ggg aga ate gga gca acc ggg aga aea aat ttg cee 

S ISFHHYTGR IQAIGRTHLP 
1801 tac aaa ete eee tee aet tee gac tta aca aag tae gac ttt cag aaa gac gaa cee acg 

YKLLSTFDtlKYOFQXOEPN 
1B61 aga aac gag eag gge tgg tge ate cat gtg aaa aaa gga gaa cee gga ett etc ace tec 

rneqowc ihvkkoepqll: S 
1931 cga gtg aat gca aaa aat ccc tte ttt gge tae get ggg ecc eat aag cac aca aaa gae 

ft V H A X MPFFOYA0PYKKTK0 
1981 aaa ttg ett tgt gae get tee aag aag gga gat gtt tae eee aac aet gga gae eca aca 

XtLCDVPXXGDVYfcMTCDLI 
2041 gte cag gat eag gac aac tec cet tat ttt egg gae ege ace gga gac act ete aga egg 

vqdqdnflyfwdrtgotfrw 

3101 aaa gga gaa aae gte gca aee ace eag gtt gee gac gee act gga acg ccg gae cee aea 

KGEMVATTEVADVlGMLOr I 
2181 cag gaa gca aac gee tac gge gtg get aea tea ggt tae gaa gga aga gea gga aeg gee 

0 E A H V yoVAXSOYEGR AGHA 
3221 tet aet ate tta aaa cea aae aca tet tea gae etg gaa aaa get tat gaa caa gtt gca 

SIXLX PHTSLDLEKVYEQVV 
2381 aca ett eta cea get cat gee cgc eea cga tee tea aga aee cag gaa aaa acg gaa gca 

TPLFATfACPRFLRXQEKHRA 
3341 aca gga aea ete aaa eta ttg aag cat cag teg geg gaa gae gga eee aat eea ecg aaa 
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egg 


gga 


3581 
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cac gea 
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act 


ata etc 
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eat gag aga 
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tte 
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2781 
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aat age gge cag 
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619 Amino Acids 



alanine 

cyscain* 

a spastic acid 

<jLacamlc acid 

phenylalanine 

glycine 

hlselolna 

laolaucine 

lysine 

laudna 

tnachiooina 

Mparflginfl 

prolinft 

gluCasiiw 

arqlnlaa 

a aria* 

valine 

tryptophan 

unknown 

eyroiina 

STOP 
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14 
34 
31 
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44 
13 

49 
7S 
11 
21 
21 
IS 
27 
40 
30 
51 
11 



2344 
1443 
3910 
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5 000 
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1781 
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6146 
8431 
1441 
2394 
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2303 
4314 

3«ai 

3031 
5052 
2046 

4339 



3.3 
2.1 
5.6 
5.7 
7.1 
.2.6 
2.5 
t.O 

B.a 
:a.i 

2.1 
3.4 
2.9 
3.3 
6.0 
£.0 
4.3 
7,3 
2.9 

6.1 



35T& 



WO 99/36537 



PCT/US99/00182 




WO 99/36537 



35/117 

MgnmMl R.port cl hFATPI.4,6 Mgnm.* urinfl CK^m-thod wHh PAM250 xddu. wrtptt Wto, 



Pag* 




Decoration 'Deration Sr, Shade (with solid bright yeUcw) reside that xn«cb the Owensus «nccly. 



hsf»TP4pep 
hsFKTP6pep 

hsEWTPlpep 
hsFATP4pep 
hsFATP6psp 

hsEAT-Mpep 
hsFKTP4pep 
hsFOTP6pep 

hsFATPlpep 
hsFftTP4pep 



36/117 



Hi 

o 

> 



© 



Q 
O 



50- 
40- 
30- 
20- 
10- 
0- 



n — 

u 
© 



. _•:■:-:•:•:■:■:•>:■ 

S:Si:S:$S: 

mii 



-:■:■:*:■; 





1 .*.*.*.*. 

•:•:•>:• 



£3 



2 



2 



PCT/US99/00182 

WO 99/36537 



37/U7 




min 



a 293 vector control: 23 pmol/(min*ri(>6 cells) 
o 293 FATP4 clone 7: 73 pmol/(min*r 106 ceils) 
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mmFATP4 DNA sequence 



ATGCTGCTTGGAGCCTCTCTGGTGGGGGCGCTACTGTTCTCCMGCTAGTC 
CTCCCCTGTTGCTCCTGTACTTGGGGTCTGGTGGCTGGCGTTTCATCC^ 
TTGGTGGCATGGTGCTCCTGAAGGTGAAGACCAAGGTGCGACGGTACCTTCAGGA^ 
GCTTCAATGGTACAGCGCCACCCGGACAAGAC&GCCCTGATTTT^ 

GGATGAGTACTCCAGTAGTGTGGCCAACTTCCTGCAGGCCCGGGGCCrGGCCTCAGGCAATGTA 

AAAACCG CAATGAGTTTGTGGGTCTGTGGCTAGGCATGGC CAAGCTGGG CGTGGAGGCGGCTCTCATCAACACCAACCTT 
AGGCGGGATGCCCTGCGCCACTGTCTTGACACCTCAAAGGCACGAGCTCTCATCTTTGG^ 

CTGTG AGATCCATGCTAGC CTGQ AGC CCACACT CAGCCTCTTCTG CTCTGGATCCTGGGAGCCCAGCACAGTGCCCGTCA 

GCAOVGAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAAGCACCTGrc 

CTCTTCTACATCTACACATCGGGOICCACGGGGCTACCCAAAGCTGCCA^ 

T TC CCTGGTGTACTATGGATTCCGCATGC GGCCTG ATGACATTGTCTATGACTGC CTCCCC CTCTACCACTCAAGCAGGA 
AACATCGTGGGGATTGGCAGTGCTTACTC CACGG CATG ACTG TGGTG ATCCGGAAGAAGTTCTC AGCCTCCCGGTTCTGG 
GATGATTGTATCAAGTACAACTGCACAGTGGTACAGTACATTGG 
TGAGGCTGAGTCTCGGCACAAGGTGCGC^TGGCACTGGGCAACGGTCTCCGG^ 

GTTTC CACATCCCCCAGGTGGCTG AGTTCTATGGGGCCA CTGAATGCAACTGTAG CCTGGGCAACTTTG ACAGCCGGGTG 

GGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCTTTGTGTACCCTATCCGTTTC 

ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAACa*£GTCAGC 

AGGACCCTCTGCGCCGTTTCGACGGGTACCTCAACCAGGGTGCCAACAACAAGAAGATTGCTAATGATGTCTT 
GGGGACCAAGCCTACCTCACTGGTGACGTCCTGGTGATGGATGAGCTGGGTTACCT^ 

CACGTTCCGCTGGAAAGGGGAGAATGTATCTACCACTGAGGTGGAGGGCACACTCAGCCGCCTGCrrCATATGGCAGATG 
TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCGAGCAGGAATGGCTGCCGTTGC^ 

GACCTGGAGAGCTTTGaVCAGACCTTGAAAAAGGAGCTGCCTCTGTATGCCCGCCCCATCTTCCTGCGCTTCTTGCCTGA 

GCTGCACAAGACAGGGACCTTCAAGTTCCAGAAGACAGAGTTGCGGAAGGAGGGCTTTGACCCATCTGTT 

CGCTGTTCTATCTGGATGCTCGGAAGGGCTGCTACGTTGCACTGGACCAGGAGGCCTATACCCGCATCCAGGCAGGCGAG 

GAG AAGCTGTG ATTTCCCC CTACATC C CTCTGAGGGC CAGAAGATG CTGGATTCAGAG CC CTAGCGTCCACCCCAGAGGG 

TCCTGGGCAATGCCAGACCAAAGCTAGCAGGGCCCGCACCTCCGCCCCTAGGTGCTGATCTCCCCTCTCCCAAACTGCCA 

AGTGACTCACTGCCGCTTCCCCGACCCT-CAGAGGCTTTCTGTGAAAGTCTCATCCAAGCTGTGTCTTCTGGTCCAGGCG 

TGGCCCCTGGCCCCAGGGTTTCTGATAGGCTCCTTTAGGATGGTATCTTGGGTCCAGCGGGCCAGGGTGTGGGAGAGGAG 

TC^CTAAG ATCCCTC CAATCAG AAGGGAGC TT ACAAAGG AAC CAAGGC AAAGCCTGTAG ACTCAGGAAG CTAAGTGGCCA 

GAGAGTATAGTGGGCAGTC^TCCCATGTCCACAGAGGATCTTGGTCCAGAGCTGCCAAAGTGTCACCTCTCCCTGCCTGC 



ACCTCTGGGGAAAAGAGGACAGCATCTGGCCACTGGGCACCTGTCTCAAGAAGTCAGGATCACACACTCAGTCCTTGTTT 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACGAGTGTCCTGTCTGTCCTTCCTGCCTGTCTGTGAGTCTGTG 
TTGCTTCTCPVTCTGTCCTAGCCTGAGTGTGGGTGGAACAGGCATGAGGAGAGTGTGGCTCAGGGGCCAATAAACTCTGC 
CTTGACTCCTC^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAW 



othFATE* protein sequence 

MLLGASIiVGALIJSKLVLKLPWTQVGFSLlJJjyLGSGGOT^IRVFIK^ 
EKL 
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hsFATPI full lenght.DNA 



10 20 30 40 

» * t 1 ) I t t I I II t , \ , t I t 1 til!) I t t t I t t I t I . , . « I 

TCGACCCACGGCGTCCGGGACCCCAAAGCAGAAGCCCGCA 40 
C AG TAGGC AC AGCGC AC CCAAG A AGGGTCCAGG AG TCTGC 80 
AGAAACAGAAAGGTCCCCGGCCTCAGCCTCCTAGTCCCTG 120 
CC 7GCC 7CC7GCC7GAGC 7 7C 7 GGG AG AC 7G A AGGCACGG 160 
C77GCAGC77CAGGA7GCGGGC7CCGGG7GCGGGCGCGGC 200 

210 220 230 240 
i i i < t i ■ ■ ■ 1 ■ > * t f i . i < i t » i i t I i i i -i ! .i t < . r i . ■ ■ f 

C7CGG7GG7C7CGC7GGCGC7G77G7GGC7GC7GGGGC7G 240 
CCG7GGACC7GGAGCGCGGCAGCGGCGC7CGGCG7G7ACG 280 
7GGGCAGCGGCGGC7GGCGC77CC7GCGCA7CG7C7GCAA 320 
GACCGCGAGGCGAGACC7C77CGG7C7C7C7G7GC7GA7C 360 
CGCG7GCGCC7GGAGC7GCGGCGGCACCAGCG7GCCGGCC 400 

410 420 430 440 

* ■ * * i ' ■ ■ ' 1 ' ■ 1 ■ 1 ' ■ ■ ■ * * * * ■ f ' ' ' ' I ■ ■ ■ » 1 ■ ■ ■ ■ * 
ACACCA7CCCGCGCA7C777CAGGCGG7AG7GCAGCGACA 440 
GCCCGAGCGCC7GGCGC-7GG7GGA7GCCGGGACCGGCGAG 480. 
TGCTGGACCTTTGCGCAGCTGGACGCCTACTCCAATGCGG 520 
7AGCCAACC7C77CCGCCAGC7GGGC77CGCGCCGGGCGA 560 
CG7GG7GGCCA7C 77CC7GGAGGGCCGGCCGGAGT7CG7G 600 

610 620 630 640 

■ « ■ * t * ■ ■ ■ t ■ ■ ■ ■ ' ■ * > ■ I * * * > 1 1 ■ 1 * * ' > ■ ■ * ■ ■ » ■ i 
GGGC7G7GGC7GGGCC7GGCCAAGGCGGGCA7GGAGGCCG 640 
CGC7GC7CAACG7GAACC7GCGGCGCGAGCCCC7GGCC77 680 
C 7GC C 7GGGC ACC 7CGGGC GC 7 A AGGCCC7GA7C 7 7 7GG A 720 
GGAGAAA7GG7GGCGGCGG7GGCCGAAG7GAGCGGGCA7C 760 
7GGGG A AAAG 777G A 7C A AG 7 7C 7 GC7C7G GAG AC 7 7GGG 800 

8 10 820 830 840 

■ ' 1 1 t 1 1 1 * 1 ■ ■ ' ■ * I t t I I 1 I I t I 1 1 t t 1 1 I | I I ! | I ! t 

GCCCGAGGGCA7CT7GCCGGACACCCACCTCCTGGACCCG 840 
CTGCTGAAGGAGGCCTCTACTGCCCCCTTGGCACAGATCC 880 
CC AGC AAGGGC A 7GG AC G A 7C G7C 777 7C 7 AC A 7C7AC AC 920 
G7CGGGGACCACCGGGC7GCCCAAGGC7GCCA77G7CG7G 960 
CACAGCAGGTACTACCGCATGGCAGCCTTCGGCCACCACG 1000 
1010 1020 1030 1040 

' ' ' ' * ' ' ' * J ' ' ' ' F ' ■ * ' ' 1 ' ' ' ' 1 ' ' * i 1 ' ' * ' I ' ' » ' > 

CC7ACCGCA7GCAGGCGGC7GACG7GC7C7A7GAC7GCC7 1040 
GCCCC7G7ACCAC7CGGCAGGAAACA7CA7CGGCG7GGGG 1080 
CAG7G7C7CA7C7A7GGGC7GACAG7CG7CC7CCGCAAGA 1120 
AA77C7CGGCCAGCCGC77C7GGGACGAC7GCA7CAAG7A 1*160 
CAAC7GCACGG7GG77CAG7ACA7CGGGGAGA7C7GCCGC 1200 



F.-a. WA 
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hsFATPI full fenght.DNA 

1210 1220 1230 1240 

....>.>«' I « * * « I * ' t t i iii t ! ii i i I tit i 1 t > i > 1 

TACCTGCTGAAGCAGCCGGTGCGCGAGGCGGAGAGGCGAC 1240 
ACCGCGTGCGCCTGGCGGTGGGGAACGGGCTGCGTCCTGC 1280 
CATCTGGGAGGAGTTCACGGAGCGCrrCGGCGrACGCCAA 1320 
ATCGGG GAGT TC TA CG GC GC C AC CGAGTGC A AC TGCAGC A 1360 
TTGCCAACATGGACGGCAAGGTCGGCTCCTGTGGTTTCAA 1400 

1410 1420 1430 1440 

.... I . . I . . . « ? t t . » I t t i i I ■'-»-» « 1 r , , , t , . , , t 

CAGCCGCATCCTGCCCCACGTGTACCCCATCCGGCTGGTG 1440 
AAGGTCAATGAGGACACAATGGAGCTGCTGCGGGATGCCC 1460 
AGGGCCTCTGCATCCCCTGCCA6GCCGGGGAGCCTGG.CCT 1520 
CC TTGTGGGTC AG A TCAACCAAC AGG ACCCGCTGCGCCGC 1560 
TTCGATGGCTATGTCAGCGAGAGCGCCACCAGCAAGAAGA 1600 

1610 1620 1630 .1640 

.... i .... i .... i ., i , l i .... I , ... i .... I 

TC GC CC AC AGCGTCTTCAGC A AGGGCG AC AGCGCCTACCT 1640 
C TC AGG TG ACGTGC T AGTGA TGG ATG AGCTGGGCT AC A TG 1680 
TACTTCCGGGACCGTAGCGGGGACACCTTCCGCTGGCGAG 1720 
GG G AG AACGTCTCC AC CACCG AGG TGG AGGGCGTGCTGAG 1 760 
CCGCCTGCTGGGCCAGACAGACGTGGCCGTCTATGGGGTG 1800 

1810 1820 1830 . 1840 

f 1 111 I,,'' 1 1 J < -> 1 » t I I ,t ■ t. 1 1 T t I I lilt t-I.J- 1.1 J -I. 1 t I. I 

GCTGTTCCAGGAGTGGAGGGTAAGGCAGGGATGGCGGCCG 1840 
TCGCAGACCCCCACAGCCTGCTGGACCCCAACGCGATATA 1880 
CCAGGAGCTGCAGAAGGTGC TGGCACCCTATGCCCGGCCC 1920 
A TC T TCC TGCGCC TC C TGCCCC AGG TGG AC ACC AC AGG'C A 1960 
CCTTCAAGATCCAGAAGACGAGGCTGCAGCGAGAGGGC TT 2000 

2010 2020 2030 2040 

t t i . \ . i . f \ t . i i I « i i t 1 i i i t I i i t i 1 t t i t ! t i t , 1 . 

TGACCCACGCCAGACCTCAGACCGGCTC TTCTTCCTGGAC 2040 
CTGAAGCAGGGCCACTACCTGCCCTTAAATGAGGCAGTCT 2080 
ACACTCGCATCTGCTCGGGCGCCTTCGCCCTCTGAAGCTG 2120 
TTCC TC TAC TGGC C AC AAACTC TGG GCC TGG TGGG AG AGG 2160 
CCAGCTTGAGCCAGACAGCGCTGCCCAGGGGTGGCCGCCT 2200 

2210 2220 2230 2240 

■ ,i i t t i i , i, I, i i t i 1 I r ■ i t t i . . i ! .i i . i i I . t i l t * t . i I , , ,. ., , 

AGTACACACCC ACC TGGC CGAGCTGT ACC TGGC ACGGCCC 2240 
ATCCTGGACTGAGAAACTGGAACCTCAGAGGAACCCGTGC 2280 
CTCTCTGC7GCCTTGGTGCCCCTGTGTCTGCCTCCTCTCC 2320 
CTGCTTTTCAGCCTCTGTC TCC TTCC ATCCCTGTCCCTGT 2360 
CTGGCCTTAACTCTTCCCTCTCTTTCTTTTCTTTCTTTCT 2400 

2410 2420 2430 2440 

i 1 1 i i. i > 1 1 f i i i i i i i 1 1 I i t i i f it i i 1 1 1 1 t r 1 1 l i J 

TTCTTTTTTTTTAAGATAGAGTCTCACTCTGCTGCCCGGG 2440 
CTAGAGTGCAGTGGTGGGATCTCGGC TCAC TGCAACCTC T 2480 
GCCTCC.TGGGGTTCAAGTGATCC TCCCACCTCAGCCTCCT 2520 
GAG TAGCTGGGAT TAC AGGC ACCCGCC ACC ACG TCC AGCT 2560' 
AATTT'TTATATTTTTAGTAGAGACGGGGTTTCACCATGTT 2600 
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hsFATPI full lengftt.DNA 

2610 2620 . 2630 2640 

■ ■ ■ 1 ( * ■ * ■ 1 « ' * ■ ' » ■ * ' l ■ < ■ ■ f * ' ■ ' > » ■ * ■ * ■ ■ f 

GGTCAGGCTGGTCTTGAACTCCTGACCT'CAGGTGATCCGC 2640 
TGGCC TCGGCC TCC C AG AG TGCTGGG ATT A TAGGCGTG AG 2680 
CCTCTGGCCCGGCCTTTCCTTTTTCCTCTCCTCTCCTGCC 2720 
GAGAGTGGAACACACGTGTCCTGGGAGCTGCATCTTGTGT 2760 
AGGGTCCAGCTGCTTTTGGGGACTGCAGGAATCATCTCCC 2800 

2810 2820 2830 2840 

* ■ ■ « 1 ■ « * ■ 1 « 1 ■ 1 1 ■ * ■ * l ' 1 1 * i * * ■ ■ I ■ ' ■ 1 * * ■ ■ ' 1 
CTGGGCCCTGGACTCGGACTGGGGCCTCCCCACCTCCCTC 2840 
TCGGCTGTGCCTTACGGAGCCCCAATCCAGGCCTCCTGTG 2880 
GCTGTTGGGTTCCAGATGCTGCAGCTCCATGTGACTTCCA 2920 
AGC AGGCC C T CC GC C C TC C CTGCTG A A TGG AG G AG CCGGG 2960 
GGTCCCCCAGGCCAACTGGAAAATCTCCCAGGCTAGGCCA 3000 

3010 3020 3030 3040 

* * * ' * ' ■ * * * ' * ' * ] * ' ' * ) » » * 1 f 1 * ' f ■ 1 ' 1 * * ' » < * ■ * 

AT TGCC TTTTGC AC TTCCCC G T TCCTGTC AC A TTTCCCC A -3040 
GCCCCACCTTCCCCTCCTGATGCCCTGAAAGCTTCCGGAA 3080 
TTGACTGTGACCACTTGGATGTCACCACTGTCAGCCCCTG 3 120 
CCTTGATGTCCCC A TTT AGC CATC TCC ATSGAGCTCCTGC 3160 
TGGAGGGCCCTGAACCCTGCACTGCGTGGCTGCCCAGCCA 3200 

3210 3220 3230 3240. 

* « ■ ■ ' « ■ « * * * « ■ ■ * ' ■ ■ 1 * 1 * 1 ■ 1 ■ ■ 1 1 * ' ■ 1 ■ f » ' * ■ r 

GC TGCC TCCTGTC CTGGGAGGAGGCCTCCTGGGTG TCC TC 3240 
ATCTGGTGTGTCTACTGGAGGGTCCCACAGGAGAGGCAGC 3280 
AGAGGGGTCAGGGGAGGTCTCCTGCCGGGGGTTGGCCTCT 3320 
CAAGCCTCAGGGGTTCTAGCCTGTTGAATArACCCCACCT 3360 
GGTGGGTGGCCCCTCCGATGTCCCCACTGATGGCTCTGAC 3400 

3410 3420 3430 3440 

,) i i t I, i i i i I i i i t f r r i i f i i i i I i i t i I t i i i f i i i t.J 

ACCGTGTTGGTGGCGATGTCCCAGACAATCCCACCAGGAC 3440 
GGCCCAGACATCCCTACTGGCTTCGCTGGTGGCTCATCTC 3480 
GAACATCCACGCCAGCCTTTCT.GGGGCCGGCCACCCAGGC 3520 
CGCCTGTCCGTCTGTCCTCCCTCCAGCAGCACCCCCTGGC 3560 
CCCTGGAGTGGTGGGGCCATGGCAAGAGACACCGTGGCGT 3600 

3610 3620 3630 3640 

I 1 1 if I . 1 t I ( , t I I 1 I T 1 , 1 t i t 1 I .1 1 I, I t 1 1 I 1 I 1 1 . 1 I t T I 

CTCATGTGAACTTTCCTGGGCACTGTGGTTTTATTTCCTA 3640 
ATTGATTTAAGAAATAAACCTGAAGACCGTCTGGTGAAAA 3680 
AAA AAA AAA AAA A A 3694 
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hsFATPI full ienqht.protein 



.10 20 * 30 40 

■ 1 ■ ■ 1 1 1 ■ ' 1 ■ ' ' ' 1 1 * ■ 1 * 1 * « * 1 1 * 1 ■ * 1 1 * 1 1 ■ * ■ ' \ 
MRAPGAGAASVVSLAILWLLGLPWTWSAAAALGVYVGSGG 40 
WRFIRI VCKTARRDLFGLSVL IRVRLELRRHGRAGHTI PR 80 
IFQAVVQRQPERLALVOAGTGECWTFAQLDAYSNAVANLF 120 
RQLGFAPGOVVA IFLEGRPEFVGLWLGLAKAGMEAALLNV 160 
NLRREPLAFCLGTSGAKAL I FGGEMVAAVAEVSGHLGKSL 200 

210 220 230 240 

i i, t t 1 t. t i t 1 \ t t t 1 t t i ,., I, J f, r M I i i r r I i t i f I t \ r f.l 

I KFCSGDLGPEG f LPDTHLLDPLLKEASTAPLAGIPSKGM 240 
DDRLFY I YTSGTTGLPKAA I VVHSRYYRMAAFGHHAYRflQ 280 
AAOVLYDCLPLYHSAGN I I GVGQCL I YGLTVVLRKKFSAS 320 
RFWDOC IKYNCTVVQY I GE I CRYLLKGPVREAERRHRVRL' 360 
A VGNGLRPA 1 WEEFTERFGVRQ 1 GEFYGATECNCS 1 ANMO 400 

410 420 430 440 

■ * « « i i * .« ' I ■ i « * f ■ i » * l ■ •« ■ « > * * * t I i i t i i « * < . f 

GKVGSCGFNSR I LPHVYP I RLVKVNEDTMELLRDAGGLC I 440 
PCGAGEPGLLVGQ I NGGDPLRRFDGYVSESATSKK I AHSV 480 
FSKGDSAYLSGDVLVMOELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEGVLSRLLGQTDVAVYGVAVPGVEGKAGMAAVADPH 560 
SLLDPNA I YGELOK VLAPYARP I FLRLLPQVDTTGTFK I Q 600 

610 620 630 640 

t i », i I i t t ,| < t t ■ i f i i ■ t t i i i . I. t i i t T . » t t i I i . . i I 

KTRLQREGFOPRQTSDRLFFLDLKQGHYLPLNEAVYTRIC 640 
SGAFAL. 647 
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hsVLACS full lenght.DNA 



10 20 30 40 
f * * 1 1 ■ ■ * ■ I * ■ ■ 1 1 ' * 1 * ' * * * > 1 ■ 1 1 » * * ' 1 ■ 1 * ■ 1 ' 1 



GGAATTCCAAAAAAAAAAAATACGACTACACCTGCTCCGG 40 • 
AGCCCGCGGCGGTACCTGCAGCGGAGGAGCTCTGTCTTCC 80 
CCTTCATCTCACGCGAGCCCGGCGTCCCGCCGCGTGCGCC 120 
CCGGCGCAGCCCGCCAGTCCGCCCGGAGCCCGC'CCAGTCG 160 
CCGCGCTGCACGCCCGGGGTGAACCGTCTGCCCTCGCTGG 200 

210, 220 230 240 

, f 1 I 1 I 1 t t I I T 1 t I 1 I 1 f I I 1 I t I I I 1 1 I I 1 1 1 1 I 1 I I I I 



GACAGAGGGCCCCGCAGCCGTCATGCTTTCCGCCATCTAC 240 
ACAGTCCTGGCGGGACTGCTGTTCCTGCCGCTCCTGGTGA 280 
ACCTCTGCTGCCCATACTTCTTCCAGGACATAGGC1ACT1 320 
CTTGAAGGTGGCCGCCGTGGGCGGGAGGGTGCGCAGCTAC 360 
GGGCAGCGGCGGCCGGCGCGCACCATCCTGCGGGCGTTCC 400 

410 420 430 440 

_1. I 1 1.1 I T 1 1 I I I t I I t I 1 1 I I 1 1 I I 1 I 1 1 I tilt! I I .1 I I 



TGGAGAAAGCGCGCCAGACGCCACACAAGCCTTTTCTGCT 440 
CTTCCGCGACGAGACTCTCACCTACGCGCAGGTGGACCGG 480 
CGCAGCAATCAAGTGGCCCGGGCGCTGCACGACCACCTCG 520 
GCCTGCGCCAGGGAGACTGCGTGGCGCTCCTTATGGGT-AA 560 
CGAGCCGGCCTACGTGTGGCTGTGGCTGGGGCTGGTGAAG 600 

610 620 630 640 

i i t i I i i i t f i i i i 1 i i i i ( i t t t f i i i i f i i i i f i i i i I 



CTGGGCTGTGCCATGGCGTGCCTCAATTACAACATCCGCG 640 
CGAAG7CCCTGCTGCAC7GCTTCCAGTGCTGCGGGGCGAA 680 
GGTGCTGCTGGTGTCGCCAGAACTACAAGCAGCTGTCGAA 720 
GAGATACTGCCAAGCCTTAAAAAAGATGATGTGTCCATCT 760 
ATTATGTGAGCAGAACTTCTAACACAGATGGGATTGACTC 800 

810 820 830 840 

i i t t, I i t I i I i i > t f t i \ i f i i t i I i i t r 1 i i i, i I. i i i i f 



TTTCCTGGACAAAGTGGATGAAGTATCAACTGAACCTATC 840 
CCAGAGTCATGGAGGTCTGAAGTCACTTTTTCCACTCCTG '880 
CCTTATACATTTATACTTCTGGAACCAC AGGTCTTCCAAA 920 
AGCAGCCATGATCACTCATC AGCGCATATGGTATGGAACT 960 
GGCCTCACTTTTG TAAGCGGATTGAAGGCAGATGATGTCA 1000 

1010 1020 1030 1040 

i t i t f i i 1 i t .1 t i i f r t .i i f i f f f I t I t i I f t ( f 1 r i, t i J 



TCTATATC AC TC TGC C CT TT T ACC AC AG TGCTGC AC TACT 1040 
GATTGGCATTCACGGATGTATTGTGGCTGGTGCTACTCTT 1080 
GCCTTGCGGACTAAATTTTCAGCCAGCCAGTTTTGGGATG 1 120 
ACTGCAGAAAATACAACGTCACTGTCATTCAGTATATCGG 1 160 
TGAACTGCrrCGGTAfTTATGCAACrCACCACAGAAACCA 1200 
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hsVLACS full lenght.DNA 

1210 1220 1230 1240 

■ * ■ ■ 1 ■ ■ ■ * * ■ *■ < t t i t i i f i i > t i i i t t I t , . , t > . i , f 

AATGACCGTGATCATAAAGTGAGACTGGCACTGGGAAATG 1240 
GCTTACGAGGAGATGTGTGGAGACAATTTGTCAAGAGATT 1280 
TGGGGACATATGCATCTATGAGTTCTATGCTGCCACTGAA 1320 
GGCAATATTGGATTTATGAATTATGCGAGAAAAGiTGGTG 1360 
CTGTTGGAAGAGTAAACTACCTACAGAAAAAAATCATAAC 1400 

1410 .1420 1430 1440 

■ i i t t * « i » l ■ < « * i i i i i t i i i i i i i i i t i i ■ i i » i ■ ._i 

TTATGACCTGATTAAATATGATGTGGAGAAAGATGAACCT 1440 
GTCCGAGATGAAAATGGATATTGCGTCAGAGTTCCCAAAG 1480 
GTGAAGTTGGACTTCTGGTTTGCAAAATCACACAACTTAC 1520 
ACC A T TT AA TGGC T A TGC TG G AGC A A AGGCTC AG AC AG AG 1560 
AAGAAAAAACTGAGAGATGTCTTTAAGAAAGGAGACC7CT 1600 

1610 1620 .1630 1640 

■i t i t ) « t t i J « i f i i » i « i I i t i i i i t i f 1 « i t t'i i i t « I 
A TTTC AA C AG TGG AG A TC TC TT A A TGGTTG ACC A TG A AAA 1640 
TTTCATCTATTTCCACGACAGAGTTGGAGATACA77CCGG. 1680 
TGGAAAGGGGAAAATGTGGCCACCACTGAAGTTGCTGATA 1720 
CAGTTGGACTGGTTGATJTTGTCCAAGAAGTAAATGTTTA. 1760 
TGG AG TGC ATGTGCC AG A TC A TGAGGGTCGC AT TGGC A TG 1800 

18)0 1820 1830 1840 

* ■ ' ' 1 * V * ' t ' 1 ' * 1 I * 1 * I .LL-L-L L 1 _t_t _l i I t » ■ 1 t T f ; I . 

GCCTCCATCAAAATGAAAGAAAACCATGAATTTGATGGAA 1840 
AGAAACTCTTTCAGCACATTGCTGATTACCTACCTAGTTA 1880 
TGCAAGGCCCCGGTTTCTAAGAATACAGGACACCATTGAG 1920 
A TC ACTGGAAC TTT TA A AC AC CGC AAA A TGACCC TGG TGG I960 
AGG AGGGC TTT A AC C C TGC T G TC A TC A A AG ATGCCTTGT A 2000 

2010 2020 2030 ^ 2040 

TTTCTTGGATGACACAGCAAAAATG TATGTGCCTATGACT 2040 
GAGGACATCTATAATGCCATAAGTGCTAAAACCCTGAAAC 2080 
TCTGAATATTCCCAGGAGGATAACTCAACATTTCCAGAAA 2120 
GAAACTGAATGGACAGCCAC.TTGATATAATCCAACTTTAA 2 1 60 
TTTGATTGAAGATTGTGAGGAAATTTTGTAGGAAATTTGC 2200 

2210 2220 2230 2240 

j ... 1 .... ' i i i I i 1 t i 1 i i t r f i i '» > i ■ < i ■ I 

ATACCCGTAAAGGGAGACTTTTTTAAA-TAACAGTTGAGTC 2240 
TTTGCAAGTAAAAAGATTTAGAGATTATTATTTTTCAGTG 2280 
TGCACCTACrGTTTGTAT'TTGCAAACTGAGCTTGTTGGAG'2320 
GGAAGGCATTATTTTTTAAAATACTTAGTAAATTAAATGA 2360 
AC 2362 
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hsVLACS full ienght.protein 



10 

I 1 1 1 1 t I M I.I .111. 



20 30 40 

i f i i r.t I i f f . I i i t i f i i i i L 



MLSAl YTVLAGLLFLPLLVNLCCPYFFQDI GYFLKVAAVG 40 
RRVRS YGQRRPART I LRAFLEKAROTPHKPFLLFROETIT 80 
YAGVDRRSNGVARALHDHLGLRQGDCVALLMGNEPAYVWL 120 
WLGLVKLGCAMACLNYN I RAKSLLHCFQCCGAKVLLVSPE 160 
LQAAVEE I LPSIKKDDVS 1 YYVSRTSNTDG I DSFLDKVOE 200 

210 220 230 240 



VSTEP I PESWRSEVTFSTPALY I YTSGTTGLPKAAM I THG 240 
R 1 WYGTGLTFVSGLK ADD V i Y I TLPFYHSAALL I G I HGC I 280 
VAGATLALRTKFSASQFWDOCRKYNVTV IQY IGELLRYLC 320 
NSPQKPNDRDHKVRLALGNGLRGO VWRQFVKRFGDIC I YE 360 
FYAATEGN I GFMNYARK VGAVGRVNYLQKK I I TYDL I K YD 400 

410 420 430 440 



VEKDEPVRDENGYCVRVPKGEVGLLVCKI TQLTPFNGYAG 440 
AKAQTEKKKLRDVFKKGDLYFNSGDLLMVDHENF I YFHDR 480 
VGDTFRWKGENVATTEVADTVGLVDFVQEVNVYGVHVPDH 520 
EGR IGMAS iKMKENHEFDGKKLFQHI ADYLPSYARPRFLR 560 
I QOT I E I TGTFKHRKMTL VEEGFNPA V I KDAL YFLDDTAK 600 

610 620 630 640 




1. 1. 1 i 



jJL 





t i I t i i t I i i i i 1 i i.i i I i t i i I 
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hsFATP3 partial, DNA 



10 20 30 40 
■ ■ ■ > i i . . i t . » i ■ i i i t « I i iti f 1 1 i i l t t ( f f ■ i , , i 

AAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGC 40 
ACAGGGTGACGGTGTTCCAGTACATTGGGGAGCTGTGCCG 80 
ATACCTTGTCAACCAGCCCCCGAGCAAGGCAGAACGTGGC 120 
CATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAG 160 
ATACCTGGGAGCGTTTTGTGCGGCGCTTCGGGCCCCTGCA 200 

210 220 230 240 

i t i i t i t i t f t t i t ! t t r t I f t t i 1 r i f . t I f i i i I i 1 i , i I 

GG TG C TG GAG AC AT A TGG AC TG AC AGAGGGCAACGTGGCC 240 
ACCATCAACTACACAGGACAGCGG'GGCGCTGTGGGGCGTG 280 
CTTCCTGGCTTTACAAGCATATCTTCCCCTTCTCCTTGAT 320 
TCGCTATGATGTCACC ACAGGAGAGCCAATTCGGGACCCC 360 
CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGC 400 
410 420 430 440 

» 1 1 ' I ' * 1 1 1 i i « » 1 i i t i I i i t i I i r t i 1 i t i < i i i i i I 

TGCTGGTGGCCCCGGTAAGCCAGCAGTCCCCATTCCTGGG 440 
C TA TGC TGGCGGGCC AG AGCTGGCCC A GGGGAAGTTGCTA 480 
AAGGATGTCTTCCGGCC TGGGGATGTTTTCTTCAACACTG 520 
GGGACCTGCTGGTCTGCGATGACCAAGGTTTTCTCCGCTT 560 
CCATGATCGTACTGGAGACACCTTCAGGTGGAAGG.GGGAG 600 

610 -620 630 640 
1 1 1 1 f 1 1 1 i ? 1 1 1 1 f 1 1 i i I t i 1 1 J 1 1 i t 1 i 1 1 1 f 1 1 1 t f 

AATGTGGCCACAACCGAGGTGGCAGAGGTCTTCGAGGCCC 640 
TAGA TTTTC T TC AGG AGG TG AAC G TC TA TGGAG TC AC TGT 680 
GCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCC TAGTT 720 
CTGCGTCCCCCCCACGCTTTGGACCTTATGCAGCTCTACA 760 
CCCACGTGTCTGAGAACTTGCCACCTTATGCCCGGCCCCG 800 

610 820 830 840 

' * ■ ■ i ■ * ' 1 I * * * * i i i t i .1 t i t i t t t t i t t i i t i i i < i l 

AT TC C TC AGGC TCC AGG AG TCTTTGGCC AC CACAG AG ACC 840 
TTCAAACAGCAGAAAGTTCGGATGGCAAATGAGGGCTTCG 880 
ACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGACCA 920 
GGC TG T AGG TGCC T AC C TGC CCC TC AC AAC TGCCCGGT AC 960 
AGCGCCCTCCTGGCAGGAAACCTTCGAATCTGAGAACTTC 1000 

1010 1020 .1030 1040 

* « ■ ■ ' » « ' » I » ■ * * 1 » 1 ■ » f 1 * * ' * ■ « 1 ' f * ■ * » 1 * * * ■ l 

CACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGG 1040 

GGCCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTA 1080 

TACCAGAACTGCGGTCACTATTTTGTAATAAATGTGGCTG I 120 

GAGCTGATCCAGCTGTCTCTGACAAAAAAAAAAAAAAAAA 1 160 
.AAAGGGCGGCCGC 1173 
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hsFATP3oartiai.protein 



10 20 30 40 

■ ■ * » 1 ■ » > > i » ■ ' * 1 > ■ ' » I * ■ « ' 1 * * ' ' i * ■ ■ ■ 1 ■ i * * t 
KFSAGGFWEOCQQHRVTVFGY I GELCRYL VNQPPSKAERG 40 
HKVRLAVGSGLRPDTWERFVRRFGPLOVLETYGLTEGNVA 80 
T JNYTGQRGAVGRASWLYKHI FPFSL IRYDVTTGEP IRDP 120 
QGHCMATSPGEPGLLVAPVSGQSPFLGYAGGPELAQGKLL 160 
KDVFRPGOVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGE 200 

210 220 230 240 
» » « < i «i* ii tttti i i > i f ti«ti ittii i i i i i ''ill 

NVATTEVAEVFEALDFLGEVNVYGVTVPGHEGRAGMAALV 240 
LRPPHALOLMOLYTHVSENLPPYARPRFLRLQESLATTET 280 
FKQQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARY 320 
SALLAGNLRI. 331 
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hsFATP4 full length 



10 20 30 40 

> * « « i 1 1 . « i . 1 1 i t > t * t i t i i i i i 1 1 1 1 f ? i \ i t it. i 

CGACCCACGCGTCCGGGCGGGCGGGGCCGGGCGGCGGGCG 40 
GGGCTGGCGGGGCGGCCGGGCCATGCAGGGCGCAGAGCCG 80 
GCTAAACCCTGCTGAGACCCGGCTCCGTGCGTCCAGGGGC 120 
GGCTAATGCCCCTCACGCTGTCTACGCTGCTGCAACCGGG 160 
CCGCATCTGGACGGGGCGCCGCGCGGCGGAGCCGACGCCG 200 

210 220 230 240 

,r ji.i 1, i I I i I I I I I I i i i I I i i i t t t I i I i , 1 i i I 1 i 1 t ; I 

6GCCACAATGCTGCTTGGAGCCTCTCTGGTGGGGGTGCTG 240 
CTGTTCTCCAAGCTGGTGCTGAAACTGCCCTGGACCCAGG 280 
TGGGATTCTCCCTGTTGTTCCTCTACTTGGGATCTGGCGG 320 
CTGGCGCTTCATCCGGGTCTTCATCAAGACCATCAGGCGC 360 
GATATCTTTGGCGGCCTGGTCCTCCTGAAGGTGAAGGCAA 400 

410 420 430 440 

_' l ' ' } f « * ' I . i t > I t t f > I t i t i I t r > t I i t i i ) 

AGGTGCGACAGTGCCTGCAGGAGCGGGGGACAGTGCCCAT 440 
TTTGT7TGCCTCTACCGTTCGGCGCCACCCCGACAAGACG 480 
GCCCTGATCTTCGAGGGCACAGATACCCACTGGACCTTCC 520 
GCCAGCTGGATGAGTACTCAAGCAGTGTAGCCAACTTCCT 560 
GCAGGCCCGGGGCCTGGCCTCGGGCGATGTGGCTGCCA.TC 600 

610 620 630 640 

, i i, i t . 1 t i i i I i i i t t i i , t i \ i t t t M t f t 1 t t i i I t t f \ f 

TTCATGGAGAACCGCAATGAGTTCGTGGGCCTATGGCTGG 640 
GCATGGCCAAGCTCGGTGTGGAGGCAGCCCTCATCAACAC 680 
CAACCTGCGGCGGG'ATGCTCTGCTCCACTGCCTCACCACC 720 
TCGCGCGCACGGGCCCTTGTCTTTGGCAGCGAAATGGCCT 760 
CAGCCATCTGTGAGGTCCATGCCAGCCTGGACCCCTCGCT 800 

810 820 830 840 

'.'II * ' ' « * I .... t 1 ... I I . I T t t 1 11 I > I 1 I ) . I 1 t I 

CAGCCTCTTCTGCTCTGGCTCCTGGGAGCCCGGTGCGGTG 840 
CCTCCAAGCACAGAACACCTGGACCCTCTGCTGAAAGATG 880 
CTCCCAAGCACCTTCCCAGTTGCCCTGACAAGGGCTTCAC 920 
AGATAAACTGTTCTACATCTACACATCCGGCACCACAGGG 960 
CTGCCCAAGGCCGCCATCGTGGTGCACAGCAGGTATTACC 1000 

1010 1020 1030 1040 

* ' * ■ )_ ' * * ' * * * * ' ' > * ■ » 1 ■ 1 » ■ 1 1 1 * } 1 1 » ■ ■ 1 ' ■ ' * 1 . 

GCATGGCTGCCCTGGTGTACTATGGATTCCGCATGCGGCC 1040 " 
CAACGACATCGTCTATGACTGCCTCCCCCTCTACCACTCA 1080 
GCAGGAAACATCGTGGGAATCGGCGAGTGCCTGCTGCATG 1 120 
GCATGACGGrGGTGATTCGGAAGAAGTTCTCAGCCTCCCG 1 160 
GTTCTG'GGACGATTGTATCAAGTACAACTGCACGATTGTG 1200 



fa. $0h 
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hsFATP4 full length 

1210 1220 1230 1240 

» > * * 1 * * * * 1 * > * * * * * * * * ' > ' ' } > * 1 ' * * ' * * * * * * ' * 

CAGTACATTGGTGAACTGTGCCGCTACCTCCTGAACCAGC 1240 

CACCGCGGGAGGCAGAAAACCAGCACCAGGTTCGCATGGC 1280 

ACTAGGCAATGGCCTCCGGCAGTCCATCTGGACCAACTTT 1320 

TCCAGCCGCTTCCACATACCCCAGGTGGCTGAGTTCTACG 1360 

GGGCCACAGAGTGCAACTGTAGCCTGGGCAACTTCGACAG 1400 

1410 1420 1430 1440 

t ' ' » * i ■ * i 1 « i t t t t t i t 1 i i i i f i i i i \ i > i t t i 1 t t ( 

CCAGGTGGGGGCCTGTGGTTTCAATA.GCCGCATCCTGTCC 1 440 

TTCGTGTACCCCATCCGGTTGGTACGTGTCAACGAGGACA 1480 

CCATGGAGCTGATCCGGGGGCCCGACGGCGTCTGCATTCC 1520 

CTGCCAGCCAGGTGAGCCGGGCCAGCTGGTGGGCCGCATC 1560 

ATCCAGAAAGACCCCCTGCGCCGCTTCGATGGCTACCTCA 1600 

1610 1620 1630 1640 
i ■ i * t t « i t I t i t < i ■ t i i I i i i i i i i i i.l i i i t < i i i t 1 

ACCAGGGCGCCAAC A AC A AG A AG AT TGCC A AGGATGTCTT 1640 
CAAGAAGGGGGACCAGGCCTACCTTACTGGTGATGTGC7G 1680 
G TG A TGG ACG AGC TGGGC T ACCTGT AC TTCCG AG ACCGC A 1 720 
CTGGGGACACGTTCCGCTGGAAAGGTGAGAACGTGTCCAC 1760 
CACCGAGGTGGAAGGCACACTCAGCCGCCTGCTGGACATG 1800 
1810 1820 1830 1840 

■ > i > I . » t « I > I I . I .... I t I I . 1 till I I 1 T t I II.. I 

GCTGACGTGGCCGTGTATGGTGTCGAGGTGCCAGGAACCG 1840 
AGGGCCGGGCCGGAATGGCTGCTGTGGCCAGCCCCACTGG 1880 
CAACTGTGACCTGGAGCGC TTTGCTCAGGTCTTGGAGAAG 1920 
GAACTGCCCCTGTATGCGCGCCCCATCTTCCTGCGCCTCC 1960 
TGCCTGAGCTGCAC AAAACAGGAACCTACAAGTTCCAGAA 2000 

2010 2020 2030 2040 

l « * t I f t l t- I > i > . t ..til . > I t I I I I 1 I 1 t l » t ) I . t ) 

GACAGAGCTACGGAAGGAGGGCTTTGACCCGGCTATTGTG 2040 
AAAG ACC CGC TG T TCT A TCT AG A TGCC C AG A AGGGCCGCT 2080 
ACGTCCCGCTGGACCAAGAGGCCTACAGCCGCATCCAGGC 2120 
AGGCGAGGAGAAGCTGTGATTCCCCCCATCCCTCTGAGGG 2160 
CCGGCGGATGCTGGATCCGGAGCCCCAGGTTCCGCCCCAG 2200 

2210 2220 2230 2240 

« * ' * ' ■ « * < i * * « t f * » * * 1 ' ' « * 1 l * * * t * * ' * 1 * * ' * I 
AGCGGTCCTGGACAAGGCCAGACCAAAGCAAGCAGGGCCT 2240 
GGCACCTCCATCCTGAGGTGCTGCCCCTCCATCCAAAACT 2280 
GCCAAGTGACTCATTGCCTTCCCAACCCTTCCAGAGGCTT 2320 
TC TGTGAAAGTCTC A TG TCCAAG T TCCG TC T TC TGGGC TG 2360 
GGCAGGCCCTCTGGTTCCCAGGCTGAGACTGACGGGTTTT 2400 

2410 2420 2430 2440 

1 1 1 I 1 , t 1 1 1 f I 1 I , I { I t T 1 1 1 1 1 I I I 1 1 L 1 I I 1 1 1 f 1 l t f 

CTCAGGATGATGTCTTGGGTGAGGGTAGGGAGAGGACAAG 2440 
GGGTCACCGAGCCCTTCCCAGAGAGCAGGGAGCTTATAAA 2480 
TGG AACC AGAGC AG A AG TC CC C AG AC TC AG G A AG TCAAC A 2520 
GAGTGGGCAGGGACAGTGGTAGCATCCATCTGGTGGCCAA 2560 
AGAG AATCG T AGC C CC AG AGC TGCCC AAG TTC AC TGGGC T 2600 



WO 99/36537 
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hsFATP4 full length 

2610 2620 2630 2640 

. . ■ ■ I t ■ « » I ' » * » t '. t 1 t 1 t 1 II ! I ! I . I I I ■ 1 t 1 , . , I 

CCACCCCCACCTCCAGGAGGGGAGGAGAGGACCTGACATC 2640 
TGTAGGTGGCCCCTGAT-GCCCCATCTACAGCAGGAGGTCA 2680 
GGACCACGCCCCTGGCCTCTCCCCACTCCCCCATCCTCCT 2720 
CCCTGGGTGGCTGCCTGATTATCCCTCAGGCAGGGCCTCT 2760 
CAGTCCrTGTGGGTCTGTGTCACCTCCATCTCAGTCTTGG 2800 

2810 2820 2830 2840 

_»'»'! * t * i I l l l * 1 I I > t I I 1 1 L 1 I J I 1 I 1 > I I I it,,) 

CC TGGCT A TGAGGGGAGGAGGAATGG-G AG AGGGGGCTC AG 2840 
GGGCCAATAAACTCTGCCTTGAGTCCTCCTAAAAAAAAAA 2880 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 2907 



WO 99/36537 
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hsFATFM full length, protein 



10 20 30 40 

i t \ i i i f i i t i i. » i i t i r t t t f i t t t > f t t t i — 



MLLGASU'GVLLFSKLVLKLPWTGVGFSLLFLYLGSGGWR 40 
FIRVFIKTIRROI FGGLVLLKVKAKVRGCLQERRTVP ILF 80 
ASTVRRHPDKTAL I FEGTDTHWTFRQLDEYSSSVANFLQA 120 
RGLASGOVAA ! FMENRNEFVGLWLGMAKLGVEAAL ] NTNL 1 60 
RRDALLHCLTTSRARALVFGSEMASA I CEVHASLDPSLSL 200 

210 220 230 240 

i i I t I I f I 1 t 1 t t I f f \ M f 1 1 f i I ) t I i t 1 . ' ' i t I I t I r I ; 



FCSGSWEPGAVPPSTEHLDPLLKDAPKHLPSCPDKGFTDK 240 
LFY I YTSGTTGLPKAAI VVHSRYYRMAALVYYGFRMRPND 280 
I VYDCLPLYHSAGNI VG IGGCLLHGMTVV IRKKFSASRFW 320 
DOC IKYNCTI VOY IGELCRYLINOPPREAENGHGVRMALG 36,0 
NGLRQSIWTNFSSRFHIPGVAEFYGATECNCSLGNFOSGV 400 

410 420 430 440 

v i . t .i I i i i i I i i i i i i i i i I i i i i i i i i i I t i i i i , 1 i j tJ — ' 



GACGFNSR ILSFVYP I RL VRVNEDTMEL I RGPDGVC J PCO 440 
PGEPGGLVGR I 1 QKDPLRRFDGYLNOGANNKK I AKDVFKK 480 
GOQAYLTGOVLVMDELGYLYFRDRTGDTFRWKGENVSTTE 520 
VEGTLSRLLDMADVAVYGVEVPGTEGRAGMAAVASPTGNC 560 
DLERFAOVLEKELPLYARP 1 FLRLLPELHKTGTYKFGKTE 600 

610 620 630 640 

» * * ■ 1 ■ 1 * * * > ' * * ( * » ' ' * * * * ' 1 * ' ' 1 I 1 r * » 1 1 1 * * * 



LRKEGFDPA I VKDPLF YLDAGKGRY VPLDGEA YSR I GAGE 640 
EKL 643 
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>hsFATP5(partial) 

GTCGTTGGGATCCTCGGCTGCTTAGATCTCGOAGCCACCTGTGTTCTGGCCCCCAAG 
TTCTCTACTT CCTGCTTCTGGGA 

TGACTGTCGGCAGCATGGCGTGACAGTGATCCTGTATGTGGGCGAGCTCCTGCGATA 
CTTGTGTAACATTCCCCAGGAAC 

CAGAGGACCGGACACATACAGTCCGCCTGGCAATGGGCAATGGACTACGGGCTGAT 

GTGTGGGGAGACCTTCCAGCAGCG 

TTTCGGTCCTATTTCGGATCTNG<^ 

GGGGCTTTAGTTCAAATATTGTT 

GGGGGCGCTGCGGGGCCCTGGGGGCAAAGATGGAGGTTGCCTCCTCGGAATGCTGT 
CCCCCTTTGAGCTGGTGCAGTTCG 

ACATGGAGGCGQCGGAGCCTGTGAGGGACAATCAGGGCTTCTGCATCCCTGTAGGG 
CTAGGGGAGCCGGGGCTGCTGTTG 

ACCAAGGTGGTAAGCCAGCAACCCTTCGTGGGCTACCGCGGCCCCCGAGAGCTGTC 
GGAACGGAAGCTGGTGCGCAACGT 

GCGGCAATCGGGCGACGTTTACTACAACACCGGGGACGTACTGGCCATGGACCGCG 
AAGGCTTCCTCTACTTCCGCGACC 

GACTCGGGGACACCTTCCGATGGAAGGGCGAGAACGTGTCCACGCACGAGGTGGAG 
GGCGTGTTGTCGCAGGTGGACTTC 

TTGCAACAGGTTAACGTGTATGGCGTGTGCGTGCCAGGTTGTGAGGGTAAGGTGGGC 
ATGGCTGCTGTGGCATTAGCCCC 

CGGCCAGACTTTCGACGGGGAGAAGTTGTACCAGCACGTTCGCGCTTGGCTCCCTGC 
CTACGCTACCCCCCATTTCATCC 

GCATCCAGGACGCCATGGAGGTCACCAGCACGTTCAAACTGATGAAGACCCGGTTG 
GTGCGTGAGGGCTTCAATGTGGGG 

ATCGTGGTTGACCCTCTGTTTGTACTGGACAACCGGGCCCAGTCCTTCCGGCCCCTG 
ACGGCAGAAATGTACCAGGCTGT 

GTGTGAGGGAACCTGGAGGCTCTGATCACCTGGCCAACCCACTGGGGTAGGGATCA 
AAGCCAGCCACCCCCACCCCAACA 

CACTCGGTGTCCCTTTCATCCTGGGCCTGTGTGAATCCCAGCCTGGCCATACCCTCA 
ACCTCAGTGGGCTGGAAATGACA 

GTGGGCCCTGTAGCAGTGGCAGAATAAACTCAGMTGYGTTCACAGAAA 
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hsFATF Spartial.protein 



10 20 30 40 

* * * ' 1 ' * ■ * * ■ ■ ' * 1 * * * ' 1 ' ■ * * ' ■ « ■ ' f ■ ■ * ' * * ■ ■ ■ i 
VVG I tGCLDLGATCVLAPKFSTSCFWDOCRQHGVTV I LYV 40 
GELLRYLCN1 PQQPEDRTHTVRLAMGNGLRADVWGDLPAA 80 
FRSYFGSXEVLRASTEGQHGALV.QILLGALRGPGGKDGAC 120 
LLRMLSPFELVGFDMEAAEPVRDNQGFC IPVGLGEPGLLL 160 
TKVVSQQPFVGYRGPRELSERKLVRNVROSGDVYYNTGDV 200 

210 220 230 240 

* 1 1 * 1 * ■ » 1 1 ■ 1 ■ * 1 * * * ■ * ' * ■ * 1 * ' » * l > » ■ * 1 » ■ > * 1 
LAMDREGFLYFRDRLGDTFRWKGENVSTHEVEGVLSQVDF 240 
LQQVNVYGVCVPGCEGKVGMAAVALAPGGTFDGEKLYQHV 280 
RAWLPAYATPHF I R I QDAMEVTSTFKLMKTRLVREGFNVG 320 
I V VDPLFVLONRAQSFRPLTAEMYQA VCEGTWRL 354 



fu- S3 
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hsFATP6 full ienght.DNA 



10 20 30 40 

» f ' t L- 1 f I I t I I 1 t t t f I t I . 1 I I I ■ \ I . . t I 



AACGGCAAG7AAGCGCAACGCAA77AA7G7GAG7AGC7CA 40 
CTCA7TAGGCACCCCAGGC777ACAC777A7GC77CCGGG 80 
C7CG7A7G77G7G7GGAA77G7GAGCGGA7ACCAA777CA 120 
CACAGGAACCAGCTATGACATGATTACGAATTTAATACGA 160 
CTCACTAf AGGGAATTTGGCCCTCGAGGCCAAGAATTCGG 200 

210 220 230 240 

'> > < > i > i > > \ > > t , i i i i > I i i it i i 1 it I t « > t i > > i > ) 



CACGAGGGGTGCTGAGCCCCTGCGCGGTTTCTGGTGCGTA 240 
GAGAC7GfAAA7CGC7GCGC77C7CAG7CATCA7CA7CCC 280 
AGCTTTTCCCGGCTCGAATTCAGCCTCCAACTCAAGCTCG 320 
CGGGAAAGACTACCTGAGAGGAGAAAAGC7TCTGTCCCTG 360 
GACCTTCTTCTGAGGGTGGAGTCGGAGGCTCCCTGCTTTC 400 
410 420 430 440 

I < I t t I t t I I, t | | , I E ' ' ' ' F l - 1 I -I I -I t- I L . I t I I t I > ■ ^ ■ f, 



C AGCCGCC C AG 7G ACCC A AGC 77 AA7C 7 7CAGCACC AC 77 440 
GGGGCGACCTTTTCGGTGCAAACCTACGATTCTGTTTCTC 480 
AGGATTCCTCCCCATCCCGCTTCGCCCCGGAAAAGCTGAC 520 
AAGAACT7CAGGTGTAAGCCCTGAGTAGTGAGGATCTGCG 560 
GTCTCCGTGGAGAGCTGTGCCTGGAAGAGAAGGACGCTGG 600 

610 620 630 640 

* * ' 1 1 ' * * ' I 1 1 ' ' 1 * ' ' ' * * ' < f * 1 1 ' ' F ' 1 1 ' * ' ' » * 1 



TGGGGGCTGAGATCAGAGCTGTCTTCTGGCCCAGTTGCCC 640 
CCATGCTTCTGTCATGGCTAACAGTTCTAGGGGC7GGAAT 680 
GGTCGTCCTGCACTTCTTGCAGAAACTCCTGTTCCCTTAC 720 
7777GGGA7GAC77C7GG77CG7G77GAAGG7GG7GC7CA 760 
77A7AA77CGGC7GAAGAAG7A7GAAAAGAGAGGGGAGC7 800 

810 820 830 840 
i i i i 1. i .,1 i 1 i i i t I i i i i f i t ■ i I 1 i i i I i i t 1 I t , i ,,f 



GGTGACTGTGCTGGATAAATTCTTGAG TCATGCCAAAAGA 840 
CAACCTCGGAAACCTTTCATCATCTATGAGGGAGACATCT 880 
ACACCrATCAGGATGTAGACAAAAGGAGCAGCAGAGrGGC 920 
CC ATG TCT TCCTGAACCA77CCTCTCTG A AAAAGGGGG AC 960 
ACGG7GGC7C7GC7GA7GAGCAA7GAGCCGGAC77CG77C 1000 

1010 1020 1030 1040 

l, I I I f I L t. I ,f 1 1 1. 1 I I t 1 t 1 I t t I t f t I | i t I t L t t | t I i 



ACG7G7GG77CGGCC7CGCCAAGC7GGGC7GCG7GG7GGC 1040 
C777C7CAACACCAACA77CGC7CCAAC7CCC7CC7GAA7 1080 
7GCA7CCGCGCC7G 7GGGCCC AG AGCCC7AG7GGTGGGCG 1 120 
CAGA777GC77GGAACGG7AGAAGAAA7CC77CCAAGCC7 1 160 
C7CAGAAAA7A7CAGTG777GGGGGA7GAAAGA77C7G77 1200 



WO 99/36537 
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hsFATP6 full lenght.DNA 

1210 1220 1230 1240 

* ■ ■ * 1 « « * * i > ■ ■ ■ 1 * ■ ■ ■ i * * ■ < i » ■ ' * 1 1 ■ ' » i ■ ■ ■ ' i 

CCACAAGGTGTAATTTCACTCAAAGAAAAACTGAGCACCT 1240 
CACCTGATGAGCCCGTGCCACGCAGCCACCATGTTGTCrc 1280 
ACTCCTCAAGTCTACTTGTCTTTACATTTTTACCTCTGGA 1320 
ACAACAGGTCTACCAAAAGCAGCTGTGATTAGTCAGC7GC 1360 
AGGTTTTAAGGGGTTCTGCTGTCCTGTGGGCTTTTGGTTG 1400 

1410 1420 1430 1440 

i \ * i i * i t . 1 ■ ■ ■ i i > » ♦ t l t * t i i i t t i I i i i * i i ■ t i i 

TACTGCTCATGACATTGTTTATA7AACCCTTCCTC7GTAT 1440 

CAT AG TTC AGC AGC TATCC TGGG AATTTCTGG A TGTGTTG 1480 

AGTTGGGTGCCACTTGTGTGTTAAAGAAGAAATTTTCAGC 1520 

AAGCCAGTTTTGGAGTGACTGCAAGAAGTATGATGTGACT 1 560 

GTGTTTCAGTATATTGGAGAACTTTGTCGCTACCTTTGCA 1600 

1610 1620 1630 1640 

* « ■ t 1 i > i i I . i r ) \ t t i 1 I i t I 1 1 T 1 I I I I 1. < t f 1 I i i t 

AACAATCTAAGAGAGAAGGAGAAAAGGATCATAAGGTGCG 1 640 

TTTGGC AA T TGGA AATGGC A TACGG AG TGATGT A TGGAG A 1680 

GAATTTTTAGACAGATTTGGAAATATAAAGGTGTGTGAAC 1 720 

TTTATGCAGCTACCGAATCAAGCATATCTTTCATGAACTA 1 760 

CACTGGGAGAATTGG AGC AAT TGGG AG AACAAATTTGTTT 1800 

1810 1820 1830 1840 

.« ■ ■_* t * * * « I ' * ■ * ' ■ t i ■ i » 1 ■ i ..).., i f 

TACAAACTTCTTTCCACTTTTGACTTAATAAAGTATGACT 1840 
TTCAGAAAGATGAACCCATGAGAAATGAGCAGGGTTGGTG 1880 
TATTCATGTGAAAAAAGG AG AACCTGG AC TTC TC ATT TCT 1920 
CGAGTGAATGCAAAAAATCCC TTC TTTGGC TATGCTGGGC 1960 
CTTATAAGCACACAAAAGACAAATTGCTTTGTGATGTTTT 2000 

2010 2020 2030 2040 

. i m , t 1 » ii , ' ' ■ i f 1 1 1 I 1 ' r ■ \ '''->!-■*- ' — i i 1— > . i l t_ I > i. i ,i I 

TAAGAAGGGAGATGTTTACCTTAATACTGGAGACTTAATA 2040 
GTCCAGGATCAGGAC AATTTCCTTTATTTTTGGGACCGTA 2080 
CTGGAGACACTTTCAGATGGAAAGGAGAAAATGTCGCAAC 2 120 
CACTGAGGTTGCTGATGTTATTGGAATGTTGGATTTCATA 2160 
CAGGAAGCAAACGTCTATGGTGTGGCTATATCAGGTTATG 2200 
2210 2220 2230 2240 

i i i i I i t r 1 1 > i i i ) i i i i 1 t \ i i 1 i i. i i t i i i i I t i t i 1 

AAGGAAGAGCAGGAATGGCTTCTATTATTTTAAAACCAAA 2240 
TACATCTTTAGATTTGGAAAAAGTTTATGAACAAGTTGTA 2280 
ACATTTCTACCAGCTTATGCTTGTCCACGATTTTTAAGAA 2320 
TTCAGGAAAAAATGGAAGCAACAGGAACATTCAAACTATT 2360 
GAAGCATCAGTTGGTGGAAGATGGATTTAATCCACTGAAA 2400 

2410 2420 2430 2440 

1 1 1 I 1 1 1 1 1 I I t I f T t I I f I t 1 1 f I I I I I ' 1 ' ' 1 I I 1 1 f 

ATTTCTGAACCACTTTACTTCATGGATAACTTGAAAAAGT 2440 
CTTATGTTCTACTGACCAGGGAACTTTATGATCAAATAAT 2480 
GTTAGGGGAAATAAAACTTTAAGATTTTTATATCTAGAAC 2520 
TTTCATATGCTTTCTTAGGAAGAGTGAGAGGGGGGTATAT 2560 
GATTCT TTATGAAATGGGGAAAGGGAGCTAACATTAATTA 2600 



Fa. 5HB 
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hsFATPS full lenght.DNA 

2610 2620 2630 2640 

■ ■ ■ ■ 1 ' * * * * * * ' * 1 » 1 * * 1 1 1 ' * i * * > * 1 > * ■ * 1 ■ * * ■ t 
TGCATGTACTATATTTCCTTAATATGAGAGATAATTTTTT 2640 
AATTGCATAAGAATTTTAATTTCTTTTAATTGATATAAAC 2680 
ATTAGTTGATTATTGTTTTTATCTATTTGGAGATTCAGTG 2720 
CATAACTAAGTATTTT'CCTTAATACTAAAGATTTTAAATA 2760 
ATAAATAGTGGCTAGCGGTTTGGACAATCACTAAAAATGT 2800 

2810 2820 2830 2840 

■ ' * t \ . . . . t i ■ i . i i i t i l . t i t i i ■ t . 1 . . . . t . . . . i 

ACTTTCTAATAAGTAAAATTTCTAATTTTGAATAAAAGAT 2840 
TAAATTTTACTGAAAAAAAAAAAAAAAAAAAAAATTGGCG 2880 
GCCGC 2885 



Ft* &c 
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hsFATP6 full lenght.protein 



10 * 20 30 40 

1,1 l t I .1 1 1. 1 1 1 1 I 1 I j, ( ?. I 1 T 1 I I 1 T T f t 1 ( f f I ! f I t 1 1 

MLLSWLTVLGAGMV VLHFLQKLLFP YFWDOFWFVLKYVL I 40 
1 [RLKKYEKRGELVTVLOKFLSHAKRQPRKPFI IYEGOIY 80 
TYQOVDKRSSRVAHVFLNHSSLKKGDTVALLMSNEPOFVH 120 
VWFGLAKLGC VVAFLNTN I RSNSLLNC I RACGPRALV VGA 160 
DLLGTVEEILPSLSEN ISVWGMKDSVPQ^V [SLKEKLSTS 200 
210 _220 230 240 

i i i t M i i ! 1 t t i i I t t t i, I t i f i I t t f i I t t t f 1 r t r t } 

PDEPVPRSHHVVSLLKSTCLYI FTSGTTGLPKAAVISGLQ 240 
VLRGSAVLWAFGCTAHDI VY I TLPLYHSSAA I LG I SGCVE 280 
LGATCVLKKKFSASGFWSDCKKYDVTVFGY IGELCRYLCK 320 
GSKREGEKDHKVRLAIGNGIRSDVWREFLDRFGNI KVCEL 360 
YAATESS I SFMNYTGR I GA I GRTNLFYKLLSTFDL IKYDF 400 

410, . 420 430 440 

t t f f I i t, i t, } t t t i ,1 t t i r I r > i i 1 i > i i 1 i i i f I > f i i 1 

QKDEPMRNEGGWC 1 HVKKGEPGLL I SRVNAKNPFFGYAGP 440 
YKHTKDKLLCDVFKKGDV7LNTGDL I VQDQONFLYFWDRT 480 
GDTFRWKGENVATTEVADV I GMLDFI QEANVYGVAISGYE 520 
GRAGMAS I ILKPNTSLDLEKVYEQVVTFLPAYACPRFLR I 560 
QEKMEATGTFKLLKHQLVEDGFNPLK ISEPLYFMDNLKKS 600 

610 620 630 640 

■* 1 ' ■ 1 * * ■ ■ * ■ 1 * * 1 ' * ■ ■ 1 1 * 1 1 T 1 * * 1 * * * * 1 1 1 ■ * ' 1 

YVLLTRELYDQ I MLGE IKL. 620 



mFATPI full length. DNA 



60/117 



10 20 30 40 

AAGTTCCCACTCCAGACTTCTGCGAGAACCCGTGAGGAAG 40 
CAGCGAGAACCGGGGGTTTGC AAGCCAGAGAAGGATGCGG 80 
ACTCCGGGAGCAGGAACAGCCTCTGTGGCCTCATTGGGGC 120 
TGCTTTGGCrTCTGGGACTTCCGTGGACCTGGAGCGCGGC 160 
GGCGGCGTTCGGTGTGTACGTGGGTAGCGGTGGCTGGCGA 200 

210 220 230 240 

I 1, I I 1 I I 1 I I, I I 1 1 I I 1 I I 1 1 1 1 1 1 I I 1 I I t t I I 1 I I I 1 I 

TTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAGACtTCT 240 
TTGGCCTCTCTGTTCTGATCCGCGTGCGGCTAGA'GCTACG 280 
ACGACACCGGCGAGCAGGAGACACGATCCCACGCATCTTC 320 
CAGGCCGTGGCCCAGCGACAGCCGGAGCGCCTGGCGCTGG 360 
TAGATGCGAGTAGCGGTATCTGCTGGACCTTCGCACAGCT ;400 

410 420 . 430 440 

* » * ' 1 ■ ■ » » F ■ 1 * ' 1 ' ' 1 * f 1 ' \ * 1 ■ 1 1 ' * 1 » 1 ■ 1 1 ' ' * T 

AGACACCTACTCCAATGCTGTGGCCAATCTGT.JCCTCCAG" 440 
CTGGGCTTTGCGCCAGGCGATGTGGTGGCTGTGrrCCrGG 480 
AAGGCCGGCCCGAGTTCGTGGGACTGTGGCTGGGCCTGGC 520 
CAAGGCCGGTGTAGTGGCTGCGCTTCTCAATGTCAACCTG 560 
AGGCGGGAGCCCCTTGCCTTCTGCfTGGGCACATCAGCTG 600 
610 620 630 640 

■J I > t f I 1 I \ f I 1. T I 1 I t I 1 f 1 t 1 t I I I 1 I \ I 1 I 1 I I 1 I I T , 

CCAAGGCCCTC ATTTATGGCGGGGAGATGGCAGCGGCGGT 640 
GGCGGAGGTGAGTGAGCAGCTGGGGAAGAGCCTGCTCAAG 680 
TTC'TGCTCTGGAGATCTGGGGCCTGAGAGCGTCtTGCCTG 720 
ACACGCAGCrTCTGGACCCCATGCTTGCTGA'GGCGCC'CAC 760 
CACACCCCTGGCACAGGCCCCAGGCAAGGGCATGGATGAT 800 

8 10 820 830 . 840 

t ■ i ■ i t i t t l . i ■ ■ i i ■ t i I t i i t i t .« t i l i t i i i « i_i < 1 

CGGCTATTTrACATCTATACTTCTGGGACCAC'cdGACTTC.840 
CTAAGGCGGCCATTGTGGTGCACAGCAGGTACTACCGCAT 880 
CGCAGCCTTCGGCCACCATTCCTACAGCATGCGGGCCAAC 920 
G A fGTGC TC TA TG AC TGC C TACCTCTCrACC AC TCAGC AG 960 
GGAACATCATGGGCGTGGGACAGTGTATCATCTACGGGTT 1000 
1010 1020 1030 1040 

I I t I [ L I I 1 I tlttT till! I I I I I 

AACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGCCGCTTC 1040 
TGGGACGACTGTGTCAAATATAATTGCACGGTAGTGCAGT 1080 
ACATCGGTGAAATATGCCGCTACCTGCTAAGGCAGCCGGT 1 120 
TCGCGATGTAGAGCGGCGGCACCGCGTGCGCCTGGCCGTG. 1 160 
GGTAACGGACTGCGGCCAGCCATCTGGGAGGAGTTCACGC 1200 



WO 99/36537 
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mFATFI full length.DNA 



1210 1220 1230 1240 

. t * ■ « I ■ ■ « » 1 i i I I I ■ ■ i i I 1 l I. t . I I l l l I l i i l 1 l l l . I 

AGGGTTTCGGTGTGCGACAGATTGGCGAGTTCTACGGCGC 1240 
CACCGAATGCAACTGCAGCATTGCCAACATGGACGGCAAG 1280 
GTCGGCTCCTGCGGCTTCAACAGCC6TATCCTCACGCATG 1320 
TGTACCCCATCCGTCTGGTCAAGGTCAACGAGGACACGAT 1360 
GGAGCCACTGAGGGACTCCCAAGGCCTCTGCATCCCGTGC 1400 
1410 1420 1430 1440 

' i ' t I i « i t ) i t t . I t t t , ) i i i t } t t i . I t l i t.l t i i t I 

CAGCCCGGGGAACCTGGGCTTCTCGT.GGGCCAGATCAACC 1440 
AGCAAGACCCTCTGCGGCGCTTCGATGGCTATGrTAGTGA 1480 
CAGCGCCACCAACAAGAAGATTGCCCACAGCGTGTTCCGA 1520 
AAGGGGGACAGCGCCTACCTTTCAGGTGACGTGCTAGTGA 1560 
TGGACGAGC TGGGGTACATGTACTTCCGTGACCGCAGCGG 1600 

1610 1620 1630 1640 

_■ ■ » ■ ' ■ ■ * » f ' ■ ' ■ * 1 1 ■ ■ » * ' » ■ » 1 * * 1 * l 1 * * 1 1 ■ ■ ' * *- 

GGATACCTTCCGATGGCGCGGCGAGAACGTATCCACCACG 1640 
GAGGTGGAAGCCGTGCTGAGCCGCCTGTTGGGCCAGACGG 1680 
ACGTGGCTGTGTATGGAGTGGCTGTGCCAGGAGTGGAGGG 1720 
GAAAAGCGGCAT GGCGGCCATTGC AGACCCCCACAACCAG 1760 
CTGGACCCTAACTCAATGTAGCAGGAATTGCAGAAGGTTC 1800 

1810 1820 1830 1840 

t 1 ■ t 1 ■ 1 I » t I I I I I I I t I I t I t I ( t 1 L 1 f I 1 I I T I t 1 > I 

TTGCATCCTATGCCCAGCCCATCTTCCTGCGTCTTCTGCC 1840 
CCAAGTGGATACAACAGGCACCTTCAAGATCCAGAAGACC 1880 
CG AC T AC AGCG TG A A GGC T T TGACCCCCGCC AG AC CTC AG 1920 
ACCGGCTCTTCTTTCTAGACCTGAAACAGGGACGCTACCT 1960 
ACCCCTGGATGAGAGAGTCCATGCCCGCATCTGCGCAGGC 2000 

2010 2020 2030 2040 

■ ' ■ ■ i » ' < < i > * ■ ' i ' > * t f i i i i 1 t i i i I i t t i i i t > i l 
GACTTCTCACTCTGAGCC TGGTGAGTGGGATGGCCCTGGA 2040 
CTTGTGAGACCAGGGAGCCGGACACCCCTGTTCAGGTGTT 2080 
TCTCCTGCCTGGCCACGTGGCCAGCAGCACCTGTGGGTGC 2120 
AGGAAACTGGAACCTGAGTGGCCGGGT.GTCCCTTTCCTAC 2160 
AACCCACCATGCACACATCTAGCCTCTGCCTTGGTCTTTT 2200 

2210 2220 2230 2240 

■ ' ' ' ' ■ ' ■ » I ■ ' ■ ■ ' ' ' ■ ' I ■ ' * ' t ' t t i I i \ i t 1 i r t i 1 

TCTCCATC TCTTTCCTCCGTGCCCAGCAGGAGCCCCACAG 2240 
ACACATTGGCTGCTGTGTCCTGCAGTGGGACCGGTGTCTA 2280 
GGGGTCCATGCTGCAGGCTGTGACCCGCACTGGTGCCCAC 2320 
CTCCCTTCCCC AT TGTGCCTTAGGTTCCTCC AC TGTGCGC 2360 
CGGTGAAGCAAGTGGGGACCCACATAGCTGTTGTCCCrGC 2400 

2410 2420 2430 2440 

« i i i i ■ »» i 1 i i i i i * i i i i .i. i i i i i i i i I i i » t i i i i i I 
TGAGGGTTGGTAGCAAATGCACCCTCATGTCAGCTGGGAG 2440 
ACACATGCAGTCTCCCACTGACCCCCAATCAACTGAAGAT 2480 
ACTGTTTTGTATTATTGTTTTGAGATAGGGTCTCACTGTG 2520 
GAGGCCAAGCTGGCCTCAGGCTCACCACTCTACTGCCTCC 2560 
GGGCACCAGCCTGCAGTTTGATGACATGTATGCACTATTG 2600 
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mFATPl full length.DNA 

2610 2620 2630 2640 

■ • ■ ■ ' i .... i .... f » ... i ». i . f .... i .... i 

TTCTAAGGGTCTTCTGAGTCCCTGCTTTCCCCTCATGTCC 2640 
TAAAACCTTCCAGAACTGACTCTGATCACTTGGATGTAGC 2630 
TAGTGTTGGCCCTGCCCACGTGTGTCAATTCAGGGGTCCC 2720 
CAGGCATCATCTCTGGAGGCCCTAACCTTGGCAAAGCTTG 2760 
GATGTCCTCACATCACAGCAGGAGACCCAGGAAGGTTGCT 2800 

2810 2820 2830 2840 

■ > ■ * f » f i * 1 ' ■ * ■ 1 t * * ■ i * ' ■ * 1 1 ■ 1 * 1 ' * •» * 1 ■ * * ■ 1 

GTGGTGTCTCTTGGGCACCCCTGGCGGCAGCCGTGGACAT 2840 
GC TTCCCTGCTG TG A TAGCCCAAACTGTTGCCTATG AC AT 2880 
TTGAGGTCTACCCTTCTGGCTGCCATGGTCCCCATTGAGA 2920 
TCTTTGGTGACTCACCTCAGCCACCAAGCCAGGCCTCTGC 2960 
CTTCCTTCAGCTCTAAGGGCATGAAGGGTGTGGACAGAGC 3000 

3010 ' 3020 3030 3040 

' ' ' ' * ' « ' ' t i r . t I . . I i 1 ~t l 1 t I l I I I I > . . 1 . . ) , I 

AGCC AC AGGC TGCC C AC AG TC ACCCAC A TGC A AG TGTT AT 3040 
TTCCTTGTTTGTTTTAAAAAAATAAACATGCTGAGCCTTG 3080 
AAAAAAAAAAAAAAAAAA 3098 



WO 99/36537 PCTYUS99/001 82 



• 63/117 

mFATPI full lenght.protein 



10 20 30 40 

. ■ > t t i . i . t t < i i f i t i t f tit t i tiii f iiti i i i t i f — , 

MRTPGAGTASVASLGLLWLLGLPWTWSAAAAFGVYVGSGG 40 
WRFLR I VCKTARRDLFGLSVL I RVRLELRRHRRAGOT I PR 80 
I FQAVAGRQPERLAL VDASSG I CWTFAQLOTYSNAVANLF 120 
LQLGFAPGDVVAVFLEGRPEFVGLWLGLAKAGVVAALLNV 160 
NLRREPLAFCLGTSAAKAL I YGGEMAAAVAEVSEQLGKSL 200 

210 220 230 240 

i t i ■ i ■ t » ■ f ' ' ' * i « « ■ * f * * * > > f « f < ? j t f i i > i . « ? 

LKFCSGDLGPE'SVLPOTQLLDPMLAEAPTTPLAQAPGKGM 240 
PDRLFY I YTSGTTGLPKAA I VVHSR YYR I AAFGHHSYSMR 280 
ANDVLYDCLPLYHSAGN IMGVGQC I I YGLTVVLRKKFSAS 320 
RFWDDCVKYNCTVVQY I GE I CRYLLRQPVRDVERRHRVRL 360 
AVGNGLRPA J WEEFTQGFGVRQ I GEFYGATECNCS I ANMD 400 

410 420 430 440 

.1 t I I I 1 I 1 I 1 t I I t I I I I J I I 1 I 1 I 1 I I 1 1 1 ) I I I T I 1 I I 

GKVGSCGFNSRILTHVYP IRLVKVNEDTMEPLRDSQGLC I 440 
PCQPGEPGLLVGQ I NQQDPLRRFDGYVSDSATNKK I AHSV 480 
FRKGDSAYLSGDVLVMDELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEAVLSRLLGQTDVAVYGVAVPGVEGKSGMAA I ADPH 560 
NQLDPNSMYQELQKVLASYAQP I FLRLLPQVOTTGTFK I Q 600 

610 620 630 640 

■ ■ ' ' * I I I I T t 1 I t 1 I I 1 t I It! 1 1 1. I I I 1 ( t 1 I I I 1 1 I f 

KTRLQREGFDPRQTSDRLFFLDLKQGRYLPLDERVHAR I C 640 
AGDFSL. 647 



Fi 3 . 57 
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mVLACS(FATP2)ful) length.DNA 



10 20 30 40 

GACACAGTACTGCCGATGTTGGACAGAGGATCGCTTAACA 40 
G AACG AAA TC TC A A AAC A AATTAACAGG AC CCGGT TGC TT 80 
GATTTCCCAAATCAGAAAAGGCTCGAAATGTCTAGAGGGG 120 
CTGACTGATGCAGCGGTGACCCGGACTGGAGACAGTTGGA 160 
CGCGATCATCTCTGGTGCTTTTGTTCAACCTTGAAACCTT 200 

210 220 230 240 
t » ■ ■ \ t t i . I . t t . > . t ■ . \ i t . . i f T 1 1 l >,..)(,, , f 

CGCCACAGGAGACTTGCCTGAGCAGAGAAGCAAACGTGGA 240 
GAAACAAAGAGAGATCTAGCGAAAAGCCTCTGGGACCAAG 280 
GAGGGGAGGTGGGACTCTGGGTTGGCGGTGGCACCTGCTG 320 
CCGGCTATTAATAATAGGGTCGCGATGCGTTTATAAGGTG 360 
TTTGAT T A AA C AA AG AC TC T A TG AG AG A AG AAT AAC T AG C 400 

410 420 430 440 

* * * » 1 * * * * i * » * * ' ■ * » 'i » * » * 1 1 ' » ■ J * * ■ ■ 1 * * ■ ' * 
A AC AGCCCC AC G TC TG AG TCGTCGCCTCCGACC ft TTTC A 440 
ACGTGGGTTCTTTGGGCCGAGCGTCGTTTGCCGAGAACTA 480 
GA TC TC AC CTG AC CCC AG ACGCTGAAAAC A AGCGC TGTGG 520 
CATCCTGGGCCACCCAAGCTGACAAGGGCGCGCCCCCTGA 560 
GCACACGAGGTGCCCCACGAGGGGGAGGGACCCACAGCtG 600 

610 620 630 640 

■ ■ * ■ * ' * ' ■ * ■ « ' « 1 * * * * 1 ■ ■ ■ 1 1 * * » ' I ' * ' ■ ' * ' » > * 
TCCCGCCCGCACCGCGGTGTCCGCTGCGGGCACCTGCAGC 640 
CGAGCCGCCACCCGCAGTCGCAGCGCGTCCGGCGGCCGAA 680 
CCCGGTCGTCAGC TCGTCAGCACCTGCTCTGCT.TCTCTCC 720 
CGCCCGCCGCCGCGCTGCACGCCTCGAGCGCTCCCTCGGC 760 
CCCGGCGGGGACCGGGGACCCCGCAGCCACCGCCATGCTG 800 

810 820 830 840 

. ... t .... r .... f .... f i .... r t .... i 
CCTGTGCTCTACACCGGCCTGGCGGGGCTGCTGCTGCTGC 840 
C TC TGC TGC TC AC C TGC TGC TGC CCCTACCTCCTCCAGG A 880 
CGTG CGG T TC T TC C TGC AAC TGGCC AAC A TGGCCCGGC AG 920 
GTGCGCAGCTACCGGCAGCGGCGACCCGTGCGCACCATCC 960 
TGCATGTCTTCTTGGAGCAAGCGCGCAAGACCCCGCACAA 1000 

1010 1020 1030 1040 

' ' t I > I I . 1 I I t t t t i t I r I t 1 < I 1 1 f t ( f I f f I t f I i i I t 

GCCCTTCCTGCTGTTTCGCGACGAGACGCTTACCTACGCC 1040 
CAGGTAGACCGGC GC AGC A ACCAAGTAGCGCGAGCGC TGC 1080 
ATGATCACCTGGGCCTGCGGCAGGGGGATTGCGTGGCCCT t 120 
CTTCATGGGCAATGAGCCGGCCTACGTGTGGCTCTGGCTG 1160 
GGAC TGC TC A AAC TGGGCTGTCCC A TGGCGTGCCTC AAC T 1200 



WO 99/36537 
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rnVLACS(FATP2)full length.DNA 

1210 1220 1230 1240 

■ t i » f i t t i l i i 1. 1 t * i i » l t i i i f i i t i f i i i i f i i 1 1 I - 

ACAACATCCGTGCCAAGTCTCTGCTACACTGCTTTCAGTG 1240 
CTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAGCTACAC 1280 
GAAGCTGTCGAGGAGG TTCTTCCAACCCTGAAAAAGGAGG 1320 
GCGTGTCCGTCTTCTACGTAAGCAGAACTTCTAACACTAA J360 
TGGCGTGGACACAGTACTGGACAAAGTAGACGGGGTGTCG 1400 

1410 1420 1430 1440 
* . . . t . « « ■ I , , < . \ . t . . i « « i « i t i < t ) i . , i i t )t i I 

GCGGACCCCATCCCGGAGTCGTGGAGGTCTGAAGTCACGT 1440 
TCACCACACCCGCAGTCTACATATAfACTTCGGGCACCAC 1480 
AGGTCTTCCAAAGGCTGCAACCATTAATCACCATCGCCTC 1520 
TGGTATGGGACCAGCCTTGCCCTGAGGTCCGGAATTAAGG 1560 
CTCATGACGTCATCTACACCACCATGCCCCTGTACCACAG 1600 

1610 1620 ' 1630 1640 

■ i ■ ■ i ■ ■ ■ i l- « ■ i ■ i > i -i ■■ < « i i i i i i i i I i i i i i i i i i I 

CGCGGCGCTCATGATTGGCCTCCACGGATGCATTGTGGTT 1640 
GGGGCTACATTTGCTTTGCGGAGCAAATTTTCAGCCAGCC 1680 
AGTTTTGGGACGACTGCAGGAAATACAACGCCACTGTCAT 1720 
TCAGTACATCGGTGAACTGCTTCGGTACCTCTGCAACACG 1760 
C C CC AGAA AC C AAA TG AC CG G GAC C AC A AAGTGA A A ATAG 1800 

1810 1820 ■ 1830 1840 

■ ■ * ■ * * ...lit., t . .i if f * t . I i t 1 1 I t 1 > t t r ( I I I 

CACTAGGAAATGGCTTACGAGGAGATGTGTGGAGAGAGTT 1840 
CATCAAGAGATTTGGGGACATTCACATTTATGAGTTCTAC 1880 
GC T TCCAC TG AAGGC A AC A T TGG AT TTATG A ACT A TCCAA 1920 
GAAAAATCGGAGCTGTTGGAAGAGAAAATTACCTACAAAA 1960 
AAAAG T TG TA AGGC AC G AGC TG A TC A AG TATGACG TGG AG 2000 

2010 2020 2030 2040 

f . . . i ■ , ■ , l , , I i i i "i i t t t i I i i i i i i i i i I 

AAGGATGAGCCTGTCCGTGATGCAAATGGATATTGCATCA 2040 
AAGTCCCCAAAGGAGAGGTTGGACTCTTGATTTGCAAAAT 2080 
CACAGAGCTCACACCATTTTTTGGCTATGCTGGAGGAAAG 2120 
ACCC AGAC AG AG AAGAAA A AGC TC AG AG A TGTTTTT A AG A 2160 
AAGGAGACGTCTACTTCAACAGTGGCGATCTCCTGATGAT 2200 

2210 2220 2230 2240 

■ > ■ > f - * t t t . . . i i i i i t f t t t i I i i 1 i i i i t i f i i i t I 

CGACCGTGAAAATTTCATCTATTTTCACGACAGAGTTGGA 2240 
GACACCTTCCGGTGGAAAGGAGAGAATGTAGCTACCACGG 2280 
AAGTCGCTGACATT6TGGGACTGGTAGATTTTGTTGAAGA 2320 
AGTGAATGTTTACGGTGTGCCCGTGCCAGGTCATGAAGGT 2360 
CGCATCGGGATGGCCTCGATCAAGATGAAAGAAAACTACG 2400 

2410 2420 2430 2440 

r i f t t i f i t I i i i t 1 t t i i I i t i f I t t i i 1 i i t t f i t i i , l 

AGTTCAATGGAAAGAAACTCTTTCAGCAC ATCTCGGAGTA 2440 
CCTGCCCAGTTACTCGAGGCCTCGGTTCCTGAGAATACAA 2480 
GATACCATTGAGATCACCGGGAC TTTTAAACACCGCAAAG 2520 
TGACCCTGATGGAAGAGGGCTTTAACCCC TCAGTCATCAA 2560 
AGATACCTTGTATTTC ATGGATGACACAGAAAAAACATAC 2600 



Fia. $$B 
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mVLACS(FATP2)fuH length.DNA 

2610 .2620 .2630 2640 

■ ■ * ■ i ■ ■ ■ * t » ■ « « 1 » * * * 1 ■ t i i t i t f ■ I t i . i i ■ ■ . . t 
GTGCCCATGACTGAGGACATTTATAATGCCATAATTGATA 2640 
AGACTCTGAAGCTCTGAATGTTGCCTGGCTCCTAACACTT 2680 
CCAGAAAGAAACACAATAGGCCTAGCA7AGCCCCTTCACA 2720 
TGTGTAATCCAACTTTAACTTGATTAAAGGTTATAGGTGT 2760 
GATTTTTCCTAGGAAATTATTCATTTAAAGGACAATTGTT 2800 

2810 2820 2830 . 2840 

■ i ' i t ' ■ ' ' F t * t t f . i i i I t I I I I r i I I I f i t i t , t i . l 

TGTTTGTTTGTTTGTTTTTTATTAATTACACCAGAACGTT 2840 
TGCAAGTAAAAAGATTTAAAGTCACTTATTTTTCAATGTG 2880 
CACCTGCCATTTGTCCTTGCAAACTTAGCTTCTTGGAGAG 2920 
AGGGCCTTATTTTTTTAAAGACATAATAAACTATGTAAAC 2960 
ACT 2963 



WO 99/36537 
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mVLACS(FATP2)fu(l length.prot 



10 20 30 40 

* ■ * > 1 ■ ■ * ■ t ' ■ ■ * 1 ■ t ■ * 1 * ' > * ( » * » ■ i * « « ' * ' 1 ■ * 1 

mlpvlytglagllllpllltcccpyllqdvrfflqlanma 40 
rqvrs yrqrrpvrt ilhvfleqarktphkpfllfrdetlt 80 
yagvdrrsngvaralhdhlglrqgdc valfmgnepayvwl 120 
wlgllklgcpmaclnyn iraksllhcfqccgakvllaspe 160 
Iheaveevlptlkkegvsvfyvsrtsntngvdtvldkvdg 200 

210 220 230 240 

Ittlltllll llfllltlfl ItTll Itllflllll 

VSADP I PESWRSEVTFTTPAVY I YTSGTTGLPKAAT I NHH 240 
RLWYG TSLALRSG I K AHD V I YTT-MPL YHSAALM I GLHGC I 280 
VVGATFALRSKFSASQFWDOCRKYNATV IQY I GELLRYLC 320 
NTPQKPNORDHK VK I ALGNGLRGDVWREF I KRFGD I H I YE 360 
FYASTEGN1 GFMNYPRK I GAVGRENYLQKKV VRHEL I KYD 400 

410 420 430 440 

* t 1 « t I t I I I t t I 1 I < ■ < . t I . t 1 > I I I t I t 1 I T 1 I I I t I 

VEKDEPVRDANGYC I KVPKGE VGLL I CK ITELTPFFGYAG 440 
GKTOTEKKKLROVFKKGDVYFNSGDLLM IDRENF IYFHDR 480 
VGDTFRWKGENVATTEVAD I V GL VDF V EE VNVYG VP VPGH 520 
EGR I GMAS I KNKENYEFNGKKLFQH I SEYLPSYSRPRFLR 560 
IQDT I E I TGTFKHRKVTLMEEGFNPSV I KDTLYFMDDTEK 600 

610 620 630 640 

1 1 ' 1 1 I t i « t I i i > I t i t | I | r i ■ ! t t i i t t ) t r ( t t t r ( . 

TYVPMTEO I YNA 1 1 OKTLKL. 62 1 
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mf ATP 4 partiai.DNA 



10 20 30 40 

t_ ( ' i i t » » i i i » i i * ■ i * ■ 1 i ?■» i i i i i i I i i » i i » i > i i 

GATCAGCTCTTCTATATCTACACGTCGGGCACCACGGGGC 40 
TACCCAAAGCTGCCATTGTGGTGCACAGCAGGTATTACCG*80 
AATGGCTGCCCTGGTGTACTATGGATTCCGCATGCGGCCT 120 
G A TG AC A TTG TC T A TG AC TGCCTCCCCCTCTACCACTC AG J60 
C AGG A AAC A TTG TGGGG A T TGGCC AG TGCGT AC TCCACGG 200 

210 220 230 240 

f i tit i f .... i t 1. 1 if , i , , \ t t . . } 

CATGACTGTGGTGATCCGGAAGAAG TTTTCAGCCTCCCGG 240 
TTCTGGGATGACTGTATCAAGTAC'AACTGCACAATTGTAC 280 
AGTACATTGGTGAGCTTTGCCGCTACC TCCTGAACCAGCC 320 
ACCCCGTGAGGCTGAGTC TC GGC AC AAGGTGCGC A TGGC A 360 
CTGGGCAACGGTCT'CCGGCAGTCCATCTGGACCGACTTCT 400 

410 420 430 440 

' ■ ' ' * F i . ■ » f . . r ■ 1 ill if , ■ ♦ , [ i i , t f 

CCAGCCGTTTCCACATTCCCAAGGTGGCCGAGTTCTACGG 440 
GGCCACCGAGTGCAACTGTAGCTTGGGCAACTTTGACAGC 480 
CAGGTGGGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCT 520 
TTGTGTACCCCATCCGCTTGGTACGAGTCAATGAGGATAC 560 
CATGGAACTGATCCGGGGACCCGATGGCGTCTGCATTCCC 600 

610 620 630 640 

* » ' * * ' ' * ■ i * * > * ' ■ ' » * f ■ 1 « ■ i ■ ' > * f * ■ * 1 * * ' * * i 
TGTCAACCAGGCCAGCCAGGCCAGC TGGTGGGTCGCATCA 640 
TCCAGCAGGACCCCC T ACGCCGTTT TG A TGGCT ACCTCAA 680 
CC AGGG TGCC A AC A AC A AG A AG A TTGC TAG TG A TGTCTTC 720 
AAGAAAGGGGACCAAGCCTACCTCACTGGTGACGTGCTGG 760 
TGATGGATGAGCTGGGCTACCTGTACTTCCGAGACCGCAC 800 

810 820 830 840 

■ ■ ' * i i t i i 1 . i t . i . ) t ■ 1 > . i « t i t i . I . i i ■ r , , . . I 

AGGGG AC AC G T TCC G C TGGA A AGGG GAG A A TGTGTCT AC C 840 
AC TG AAG TGG AGGGC AC AC TCAGCCGCCTGCTTC AG ATGG 880 
CAGATGTGGCTGTTTATGGTGTTGAGGTGCCAGGAGCTGA 920 
GGGCCGAGCAGGAATGGCTGCTGTGGCAAGCCCCACTAGC 960 
AACTGTGACCTGGAGAGC TTTGCACAGACCTTGAAAAAGG 1000 

1010 1020 1030 1040 

« < ■ * i * » ■ * l i > t f t < i i i I ii t i i i t i t 1 i i i i i > t i i I 

AGCTGCCCCTGTACGCCCGCCCCATCTTCCTCCGCTTCTT 1040 
GCCTGAGCTGCACAAAACAGGAACCTTCAAGTTCCAGAAG 1080 
ACAGAGTTGCGGAAGGAGGGCTTTGACCCGTCTGTTGTGA 1 120 
AAG ACCCACTC T TC T ATT TGGA TGCCCGG AC AGGCTGCT A 1 160 
TGTTGCACTGGACCAAGAGGCCTATACCCGCATCCAGGCA 1200 



WO 99/36537 



PCT7US99/00182 



69/117 

mFATP4 partial.DNA 



1210 1220 1230 1240 

I i t v f t i i i I i i i i I l i i i 1 1 t ... r , i ,t„l w i.j J i i t * I t i i i 1 ,. 

GGCGAGGAGAAGCTGTGATTTCCCCCACATCCCTCTGAGG 1240 
GCCAGAGGATGCTGGATTCAGAGCCCCAGCTTCCACTCCA 1280 
GAAGGGGTOTGGGCAAGGCCAGACCAAAGCTAGCAGGGCC 1320 
CGCACCTTCACCCTAGGTGCTGAT CCCCCT 1350 



PCT/US99/00182 



70/117 

rnFATP4partiaLDNA 



10 20 30 * 40 

■ ■ ■ * 1 ' » * » i * * * ' ' ' » * * i * » * *- 1 f * ' * i ' * * * > » * « * l 
DQLFYIYTS6TTGLPKAAI VVHSRYYRMAALVYY6FRMRP 40 
DO I V YDCLPL YHSAGN I VG i GQCVLHGMT VV I RKKFSASR 80 
FWDDC I KYNCT I VQY 1 GELCRYLLNQPPREAESRHKVRMA 120 
LGNGLRQ5 I WTDFSSRFHI PKVAEFYGATECNCSLGNFDS 160 
QYGACGFNSR I LSFVYP I RLVRVNEDTMEL I RGPOGVC [P 200 

210 220 230 240 

■ i i t t « t i . I i i i i i i i i t t i i i i 1. 1 i t t t » i t t i i . f 

CQPGQPGQLVGR I I QGDPLRRFDGYLNQGANNKK I ASDVF 240 
KKGDQAYLTGDVLVMDELGYLYFRDRTGDTFRWKGENVST 280 
TEVEGTLSRLLOMADVAVTGVEVPGAEGRAGMAAVASPTS 320 
NCDLESFAQTLKKELPL YARP I FLRFLPELHKTGTFKFQK 360 
TELRKEGFOPSVVKOPLFYLDARTGCYVALDQEAYTR I OA 400 

410 420 430 440 

■ * * * 1 * * ' * t ■ 1 ■ ■ 1 ■ * ■ * i ■ ■ ' 1 1 1 1 * 1 l 1 ■ ■ 1 * •■ » * ' I 
GEEKL. 406 
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mmFATPI full length.DNA 



10 20 30 40 

* * 1 ' ' 1 ' i ■ ■ * ■ 1 ■ » 1 * * » 1 ■ 1 * ■ ■ > 1 * » * 1 1 1 ' * » * * 
ATGCGGGCTCCTGGAGCAGGAACAGCCTCrGTGGCCTCAC 40 
TGGCGCTGCTTTGGTTTCTGGGACTTCCGTGGACCT-GGAG 80 
CGCGGCGGCGGCGTTCTGTGTGTACGTGGGTGGCGGCGGC 120 
TGGCGCTTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAG 160 
ACCTCTTTGGCCTCTCTGTTCTGATTCGTGtTCGGCTAGA 200 
210 220 230 240 

*''*ll'''ltl)lfl'1llf t I t I i I I 1 I I I t 1 1 t I r I I f 

GCTGCGACGAC ACCGGCGAGCAGGAGACACGATCCCGTGC 240 
ATCTTCCAGGCTGTGGCCCGGCGACAAGCAGAGCGCCTGG 280 
CACTGGTGGACGCCAGTAGTGGTATATGCTGGACCTTCGC 320 
ACAGCTGGACACCTACTCCAATGCTGTAGCCAACCTGTTC 360 
CGCCAGCTGGGCTTTGCACCAGGCGATGTGGTGGCTGTGT 400 

410 - 420 430 440 

» t t i i « . t . t > ■ . i t * » i * t * t i t t . i t t f t i t . i t , t , \ 

TCCTGGAGGGCCGGCCGGAGTTCGTGGGACTGTGGCTGGG 440 
CCTGGCCAAGGCCGGTGTGGTGGCTGCTCTTCTCAATGTC 480 
AACCTGAGGCGGGAGCCCCTGGCC TTCTGCCTGGGC AC AT 520 
CAGCTGCCAAGGCCCTCATTTATGGCGGGGAGATGGCAGC 560 
GGCGGTGGCGGAGGTGAGCGAGCAGCTGGGGAAGAGCCTC 600 

610 620 630 640 

■ ■ * » I ■ ■ ■ < 1 1 ■ ■ ' t ' * ' " I ' » ' t I t i t i I t i t t I i ■ » i 1 
CTCAAGTTCTGCTCTGGAGATCTGGGGC-CTGAGAGCATCC 640 
TGCC TG AC AC GC AGC TCCTGG A CCCC A TGC TTGCTGAGGC 680 
GCCCACCACACCCCTGGCACAAGCCCCAGGCAAGGGCATG 720 
GATGATCGGCTGTTTTACATCf ATACTT-CTGGGACCACCG 760 
GGCTTCCTAAGGCTGCCATTGf GGTGCACAGCAGGTACTA 800 

810 820 830 840 

' ' ' ' 1 ' ' ' ' 1 < 1 ' ■ 1 '/ t i I i I i t 1 i r i t I t i t ■ I i . I 

CCGC AT TGC TGC C T TTGGC C AC C A TTCCT AC AGC A TGC GT 840 
GCCGCCGA TG TGC TCTATG AC TGCCTGCCACTCTACCACT 880 
CTGCAGGGAACATCATGGGTGTGGGGCAGTGCGTCATCTA 920 
CGGGTTGACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGC 960 
CGC TTC TGGG A TG AC TG TG TC A AG T AC AAT TGC ACGG TAG 1000 

1010 1020 1030 1040 

i t ■ t i J ( i i i liii . flriirlffi i l . i l. 1 i t t i I t i f i i 

TGGATGACATAGGTGAAATCTGCCGCTACCTGCTGAGGCA 1040 
GCCGGTTCGCGACGTGGAGCAGCGACACCGCGTGCGCCTG 1080 
GCCGTGGGTAAT1GGGCTGCGGCCAGCCATCTGGGAGGAGT 1 120 
TCACGCAGCGCTTCGGTGTGCCACAGATOGGCGAGTTCTA 1 160 
CGGCGCTACCGAGTGCAACTGCAGCATTGCCAACATGGAC 1200 
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mmFATPI full length.DNA 

1210 1220 1230 1240 

■ * ■ ■ ' ■ ■ » < f * » * 1 f « * ■ ■ 1 ' ■ « 1 1 » ■ « ' f ■ » * * * ' * ■ ■ r 

GGCAAGGTCGGCTCCTGCGGCTTCAACAGCCGTATCCTCA 1240 
CGC A TG TG T ACCC C A TC CG T C TGG TC A AGG TC A A TG AGG A 1280 
CACGATGGAGCCACTGCGGGACTCCGAGGGCCTCTGCATC 1320 
CCGTGCCAGCCCGGGGAACCCGGCCTTCTCGTGGGCCAGA 1360 
TCAACCAGCAGGACCCTCTGCGGCGTTTCGATGGTTATGT 1400 

1410 1420 1430 1440 

* ■ ■ ■ 1 » > ■ ■ * ■ ■ » * i ■ ■ » ■ * 1 * » ■ ' ■ » ■ ' 1 1 ■ ■ ■ * ■ ■ ■ ■ f 

TAGTGACAGTGCCACCAACAAGAAGATTGCCCACAGCGTT J 440 
TTCCGAAAGGGCGATAGCGCCTACCTCTCAGGTGACGTGC 1 480 
TAGTGATGGACGAGCTGGGCTACATGTATTTCCGTGACCG 1 520 
CAGCGGGGACACCTTCCGCTGGCGCGGGGAGAACGTGTCC 1560 
ACCACGGAGGTGGAAGCCGTGCTGAGCCGCCTACTGGGCC 1600 

1610 1620 1630 1640 

■ ■ * * f ■ ' * * f * * * * f * ■ ■ ■ * ■ * ■ * 1 ' * ■ ■ f » ■ ■ * * * * ■ * 1 
AGACGGACGTGGCTGTGTATGGGGTGGCTGTGCCAGGAGT 1640 
GGAGGGGAAAGCTGGCATGGCAGCCATCGCAGATCCCCAC 1680 
AGCC AG T TGG AC CC T A AC TC A A TG T ACC AGG AATTAC AG A 1720 
AGGTTCTTGCATCCTATGCTCGGCCCATCTTCCTGCGTCT 1760 
TC TGC CCC AGG TGGA T ACC AC AGGC ACC TTC A AG A TCC AG 1 800 

1810 1820 1830 1840 

* t ' i * . ' " I . , m ] i , i i J » f i i 1 f i t t ! i I f t I i r > r I 

AAGACCCGGCTGCAGCGTGAAGGCTTTGACCCCCGTCAGA 1840 
CCTCAGACAGGCTCTTCTTTCTAGACCTGAAGTCCGGCAC 1880 
G AGG TA TC T ACC CC TGG A TG AG AGAG TCC A TGCCCGC ATT 1920 
TGCGCAGGCGACTTCTCACTCTGAGCCTGGAGAGTGGGCT 1960 
GGGCCTGGACTCCTGAGACCTGGGAGCCTGACACCCCTCT 2000 

2010 2020 2030 2040 

«» * i i i i * . I i i i ■ i i » i ». I i * i i i i i t i I i i * > i i i i i I 

TCGGG TGC T TC T CC TGCC TGGCC AC A TGG AC AGCAGC ACC 2040 
TGTGAGAGTAGGAAAATGGAACCTGAGTGGCTGGGACCCC 2080 
TCTCCTACTTCCCACTATGCATCCATTTTGCCTCTGCCTT 2120 
GATCTTTTTCTCCATCTCTTTTCTCCCTACCCAGCAGGAG 2160 
CCCC AC A AAC AC A TG T TGGC TG CTGTG TCC TGC AG T TGG A 2200 

2210 2220 2230 2240 

I ,1 t t f 1 t 1, 1 1 > I 1 I ! 1 ,t 1 1 \ t I 1 1 f I 1 t I f I I t T f I T | |. 1 

CCAGTGTCCAGGGGTACAGGCTTCAGGCTGTGACCCACAC 2240 
TGG TACCCACCTCCC TT TCC TATTT TGCC TT AGG TTC A TC 2280 
CACGGTTCCCCTGTGGAGCAAGTGGGGGCCCACATAGCTG 2320 
C TG TC C CTGC TG AGG G T TGG T AG CAATC AC ACC CTC A TGT 2360 
CAGCTGGGAGACACGCGCAGTCTCCCACTGACCCCCAATC 2400 

2410 2420 2430 2440 

■ i - 1 i f * i -i- * I i i «' f i * i i I > t > i t t i i i I tiitl tiiiT 

AACTGAAAATATTGTTTTGACTACTTTTTGTTTTTTTGTT 2440 
TTTTTGTTTTTTTTTTTTTTCGAGACAGAGTTTCTCTGTA 2480 
TAGCCC TGGC TG TCC TGG A AC TC AC TT TGT AG ACC AGGCT 2520 
GGCCTCGAACTCAAAAATCCTCCTGACTCTGCCTCTGCTT 2560 
CCCAAGTGCTGGGATTAAAGACGTGCGCCACCACCGCCTG 2600 



fia. a& 
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mmFATPI full length.DNA 

2610 2620 2630 2640 

t . . i I I y i f i i i f T i i i i I i i i t I i t i i I i l t-i 1 ) i . ■ [ 

GCTGTTTTGTATTTTTGTTTTGTTTTGACGATAGGGTCTC 2640 
ACTGTGGAGGCCAAGCTGGCCTCAGACTCCCCACCCCATT 2680 
GCC TC TGGGCACC ATTC TATA TTCTCAGACTGATGACAAT 2720. 
GC AC TAGTGTCCC TAGGAGTCTTGAGTC TGC AC TTTCCCC 2760 
TCATAGCCTCAAGCTTCCAGAACTGACTCTGATCACTTGG 2800 

2810 2820 2830 2840 

i i « ■ < ■ i » * * * ' ■ « < * * ■ ' 1 -» * ' > i » ' « ' I > ' ■ > ' ' * * ■ t 

ATGTGGCTAGTGTTGGCTCTACCCACATGTGTCAATTCAG 2840 
GGGTCCCCAGGCATAGTCTCTGGAAGCCCTCACCCGGAAA 2880 
AAGCTTGGAGAGACCCAGGAAGGTTGTTGTGTTCTCTTGG 2920 
GCACCCCCTGGTGGCAGTCCTGGGCATGCTTCCGCACTGT 2960 
ACTGGTGCATATAGCCCAGACCTATGACATTTGAGGTCTA 3000 

3010 3020 3030 3040 

■ f - ' ' ' ■ * ■ * ■ * ' ' 1 » ■ * ' 1 ■ ' ■ ' 1 ' ' * ' 1 ' ' ' ■ } ■ » ' » 1 : 

CCCTTCTGGCTCCTGTGGTCCCCATTGAGATCCTTGGTGA 3040 
CTC ACC TC AG TCACC A AG CAGAGCCTCTGCCTGCCTTC AT 3080 
CTTCAAGGTCATGAAGGATGTGGACAGAGCAGCTACAGGC 3120 
TGCCAGC AG TC A ACC AC A TG AG AG TGTTACTTCCTTGTTG 3160 
GT TTT TAA AA AA TAA A TG TGC TGAGCCTCGA A AAAAAAAA 3200 

3210 3220 3230 3240 

■_' t « 1 I t I 1 t 1 I 1 I I ttllt llttl ! ( t t 1 I 1 I I 1 tttlf , 

AAAAAAAAAAAAAAAAA 3217 
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mmFATPI full length.protein 



10 20 30 40 

* i ' ' « * i ' * < * i » * * * i * > * > l » » > > i ' ' * • i 



MRAPGAGTASVASLALLWFLGLPWTWSAAAAFCVYVGGGG 40 
WRFLR J VCKTARRDLFGLSVL I RVRIELRRHRRAGDT IPC 80 
I FQAVARRGPERLALVDASSG I CWTFAQLDTYSNAVANLF 120 
RQLGFAPGOVVAVFLEGRPEFVGLWLGLAKAGVVAALLNV 160 
NLRREPLAFCLGTSAAKALI YGGEMAAAVAEVSEQLGKSL 200 

210 220 230 240 
■ ■ ■ ■ i r i I ■ i i t r i > i i I i t < i i -i 



LKFCSGDLGPES ILPDTQLLDPMLAEAPTTPLAQAPGKGM 240 
DORLFYI YTSGTTGLPKAAI VVHSRYYR I AAFGHHSYSMR 280 
AADVLYOCLPLYHSAGN I MGVGQCV I YGLTVVLRKKFSAS 320 
RFWODCVKYNCTVVOO I GE I CR YLLRQPVRDVEQRHRVRL 360 
AVGNGLRPA I WEEF7QRFGVPQ I GEFYGATECNCS I ANMD : 400 
410 420 430 440 

t t , t i 1 t i t t 1 i t t I \ r t. i i 1 \ i i i ) i I ii T i r > ■ t 1 i i i i J 



GKVGSCGFNSR ILTHVYPIRLVKVNEOTMEPLRDSEGLC I 440 
PCGPGEPGLLVGQi NGGDPLRRFDGYVSOSATNKK I AHSV 480 
FRKGDSAYLSGDVIVMDELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEAVLSRLLGQTDVAVYGVAVPGVEGKAGMAAI ADPH 560 
SQLDPNSMYQELQK VLASYARP I FLRLLPQVDTTGTFK I Q 600 

610 620 630 640 

■ ■ ■ ■ 1 ' * * ■ * ■ * * ■ 1 * ' ■ * i ' 1 ■ ■ f ' 1 1 < t ■ ■ < 1 1 > ■ > * \ 



KTRLQREGFDPRQTSDRLFFLDLKSGTRYLPLDERVHAR I 640 
CAGDFSL 647 
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mmFATP2 full length,DNA 



10 20 30 40 

■ ■ » * i ■ f ■ ' f * * ■ « ( ■ ■ * ■ I > 1 1 1 1 ' * * ' J t i i i i i i i i i 

GGGCGGAGGCCGAGCCCAGTCGCCAGCTCCTGCTCTGCTC 40 
CTCTCCCGCCTGCCGCCGCGCTGCACGCCTCGAGCACTCC 80 
CTCGGCCCCGGCGGGGACCGGGGACCCCGCAGCTACCGCC 120 
ATGCTGCCAGTGCTCTACACCGGCCTGGCGGGGCTGCTGC 160 
TGCTGCCTCTGCTGCtCACCTGCTGCTGCCCCTACCTCCT 200 

210 220 230 240 
. . . . I » * j . 1 i i > > I > » i r 1 i i i i.L, ,..,),,,,! 

CCAAGATGTGCGGTACTTCCTGCGGCTGGCCAACATGGCC 240 
CGGCGGGTGCGCAGCTACCGGCAGCGGCGACCCGTGCGTA 280 
CCATCCTGCGGGCCTTCCTGGAACAAGCGCGCAAGACCCC 320 
ACACAAGCCCTTCCTGCTGTTCCGAGACGAGACGCTCACC 360 
TACGCCCAGGTGGACCGGCGCAGCAACCAAGTGGCGCGGG . 400 

410 420 430 440 

» t i t 1 i t i i 1. t i i i I i i i"i I i i r i I 1 I t f L I i i i 1 I i i i 1 

CGCTGCACGATCAACTGGGCCTACGACAGGGGGATTGCGT 440 
AGCCCTCTTCATGGGCAATGAGCCG6CCTACGTGTGGATC 480 
TGGCTGGGACTGCTCAAA'CTGGGCTGTCCCATGGCGTGCC 520 
TCAACTACAACATTCGTGCCAAGTCTCTGCTGCACTGCTT 560 
TCAATGCTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAT 600 

610 620 630 " 640 

rfttliiffttfrfil i t i t ) Itttl lltllttlll 

CTACAAGAAGCTGTGGAGGAGGTTCTTCCAACCCTGAAAA 640 
AGGATGCCGTGTCCGTCTtTTACGTAAGCAGAACTTCTAA 680 
CACAAATGGTGTGGACACAATACTGGACAAAGTAGACGGA 720 
GTGTCGGCGGAACCCACCCCGGAGTCGTGGAGGTCTGAAG-760 
TCACTTTTACCACGCCAGCAGTATACATTTATACTTCGGG 800 

810 820 830 840 

' t t t I ■ ■ ■ * \ ■ i i t 1 i i ■ r t t i i r t i i i i [ t 1 i i I i t t i t 

AACCACAGGTCTTCCAAAAAGCGGAACCATCAATCATCAT- 840 
CGCCTAAGGTATGGGACAAGCCTTGCTATGTCGAGTGGGA 880 
ATCACGGCCAAGGATGTCATCTATACCAACAATGCCCCTG 920 
TTCCAACAGTGCAACGCTCAAGATCGGCCTTCACGGATGC 960 
ATCCTGGGTTGGGGCTACTTTAACC TTGGCGGGGC AAA TT 1000 

1010 1020 1030 1040 

■ « ' ' 1 ' J t t 1 . i j i ( I t . r . I . r . t I t t f f I I I t 1 1 I I t I \ 

CTCAAGCAAGCCAATTTTGGGAACGACTGGCAGGAAATAC 1040 
AACGTCAACGGTCATTCAGTACATTGGTGAACTGCTTCGG 1080 
TACC TGTGCAACACACCGCAGAAACCAAATGACCGGGACC 1 120 
ACAAAGTGAAAAAAGCCCTGGGAAATGGCTTACGAGGAGA 1160 
TG TGTGGAGAG AG TTC A TC A AGAG AT TTGGGGACATCCAC 1200 
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mmFATP2 full length.PNA 

1 2 1 0 1220 1230 1240 

_■ t ■ t i ■ < ■ ■ i * * ■ ■ ' i i i i i i i < i i t t i t f i i i ■ \ t i , , i 
G TG T A TG AG 1" TC T ACGC A TCC AC TG A AGGC A AC AT TGG AT 1240 
TTGTGAACTATCCAAGGAAAATCGGTGCTGTCGGGAGAGC 1280 
AAACTACCTACAAAGAAAAGTTGCAAGGTATGAGCTGATC 1320 
AAGTATGACGTGGAGAAGGACGAGCCGGTCCGTGACGCAA !360 
ATGGATATTGCATCAAAGTCCCCAAAGGTGAGGTTGGACT 1400 

1410 1420 1430 1440 

■ * ' ■ * « ' * l 1 ' * ' ' I * ' ' F I > ' ■ ' t t I I I 1 I I I t I I T 1 t f 

C TTGG T TTGC AAA A TC AC AC AG CTC AC ACC A TTTATTGGC 1440 

TATGCTGGAGGAAAGACCCAGACAGAGAAGAAAAAACTCA 1480 

GAGATGTCTTTAAGAAAGGCGACATCTACTTCAACAGCGG 1520 

AGACCTCCTGATGATCGACCGTGAGAACTTCGTCTACTTT 1560 

CACGACAGGGTTGGAGATACTTTCCGGTGGAAAGGAGAGA 1600 

1610 1620 1630 1640 

■ « ■ * * > ■ » ■ 1 t « « t / r t t F i i i i I t t i i J i i t t f 

ACGTAGCTACCACAGAAGTGGCTGACATCGTGGGACTGGT" 1640 
AGATTTTGTTGAAGAAGTGAATGTGTATGGCGTGCCTGTG 1680 
CCAGGTCATGAGGGTCGAATTGGGATGGCCTCCCTCAAGA 1720 
TCAAAGAAAACTACGAGTTCAATGGAAAGAAACTCTTTCA 1760 
ACACATCGCGGAGTACCTGCCCAGTTACGCGAGGCCTCGG 1800 

1810 1820. 183Q 1840 

» » i ■ i ■ i t t l i » ■ t i « . . ■ I » i i t t t i i t t * t j i i i i i' i i 

TTCC TGAGG AT AC AAG A T ACC AT TG AG A TC AC TGGG AC TT 1840 
TTAAACACCGCAAAGTGACCCTGATGGAAGAGGGCTTCAA 1880 
TCCCACAGTCATCAAAGATACCTTGTATTTCATGGATGAT. 1920 
GCAGAGAAAACATTTGTGCC'CATGACTGAGAACATTTAtA 1960 
ATGCCATAATTGATAAAACTCTGAAGC TCTGAATATTCCC 2000 

2010 2020 2030 2040 

■ . . . I . ■ ■ ■ f ■ i f ■ t «iit t ii i i f it f i 1 i i i r I i i t if 

TGGTGGTTTAGCTCATGACATTTCCAGAAAGAAACTCGAT 2040 
AGACCTCGCAGAGCCACTTCATACGTAGAATCCAACTTTA 2080 
ACTTGATTGAAGACTATAAGGTGCGATTTTATTTTTAGGA 2120 
AATTATTCATTAAAAGGATAGTTTTTTTTTTTTTTT'TTAA 2160 
TTACACCTGAACCTTTGCAAGTAAAAAGATTTAGAGACAA 2200 
2210 2220 2230 2240 

* ■ ■ » I > ■ » i r < t i r ■ 1 ' ' ■ ' 1 ■ ' ' ' * ' ■ ; ' ' 1 « 1 ' 1 1 ■ ' ■ ■ 1 

TTATTTTTCAATGTGC ACC TGCC A TTTG TCC TTGC A AACT 2240 
AAGCTTCTTGGAGAGAGGGCCTTATTTTTTTAAAGACATA 2280 
ATAAACTATATTAACACTAAAAAAAAAAAAAAAAAAAAAA 2320 
AAAAAAAAAAAAAAAAAA 2338 



fc q ,648 
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mmFATP2 full length, protein 



10 20 30 40 

.1 i f..i 1 ) i i ) 1 ? f ) I f f f i f t i r i i I i r i i 1 i r i i 1 t t r i I 

MLPVLYTGLAGLLLL PLLITCCCPYLLOOVRYFLRLANMA 40 
RRVRSYRQRRPVRT I LRAFLEQARKTPHKPFLLFRDETLT 80 
YAQVDRRSNQVARALHDGLGLRQGDCVALFMGNEPAYVW I 120 
WLGLLKLGCPMACLNYN IRAKSLLHCFGCCGAKVLLASPO 160 
LGEA V EE VLPTLK KDA VS VFY VSR TSNTNGVOT I LOK VDG 200 

210 220 230 240 

■ i » « > ■ « ■ t I i i i t i t T t t l r i -i i r t t i f 1 1 i i i i i t i t \ 

VSAEPTPESWRSEVTFTTPAVY I YTSGTTGLPKSGTINHH 240 
RLRYGTSLAMSSGNHGQGCHLYQQCPCSNSATLK IGLHGC 280 
I LGWGYFNLGGANSQASQr WERLAGNTTSTV I QY I GELLR 320 
YLCNTPQKPNDROHK VKKALGNGLRGOVWREF I KRFGD I H 360 
VYEFYASTEGN I GFVNYPRK I GAVGRANYLQRK VARYEL I 400 

410 420 430 440 

■ * « * * . ... I ... t r .i i . f i i « i t I i . . * f . . . . { 

KYDVEKDEPVRDANGYC I KVPKGEVGILVCK I TQLTPF I G 440 
YAGGKTQTEKKKLRDVFKKGD I YFNSGDLLM I DRENFVYF 480 
HDRVGDTFRWKGENVATTEVAD I VGLVDFVEEVNVYGVP V 520 
PGHEGR IGMASLK IKENYEFNGKKLFGH IAEYLPSYARPR 560 
FLR IQOT I E I TGTFKHRKVTLMEEGFNPTV I KDTLYFMDO 600 

610 620 630 640 

■ ' - t * t - ■ ■ « t t i . i . I . . ■ ifiriiliiiiftiiMi.tif 

AEKTFVPMTEN1 YNA I IDKTLKL. 624 
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mmFATF3 partial.DNA 



10 20 30 40 

* ■ ■ ■ * f ' ■ * * ! * 1 1 * 1 ' 1 * ' 1 » * ■ 1 { 1 ■ ■ 1 * ■ ' ■ 1 T 



GAAAGCTCTGAGAGCGGGTGCAGTCTGGCCTGGCGTCTCG 40 
CGTACCTGGCCCGGGAGCAGCCGACACACACCTTCCTCAT 80 
CCACGGC6CGCAGCGCTTTAGCTACGCGGAGGCTGAGCGC 120 
GAGAGCAACCGGATTGCTCGCGCCTTTCTGCGCGCACGGG 160 
GCTGGACCGGGGGCCGCCGAGGCTCGGGCAGGGGCAGCAC 200 
210 220 230 240 

t 1 I I I « 1 1 I I »• t I I I I I t I 1 I I I I 1. I I I I I t I I I I t_ L . I t 1 



TGAGGAAGGCGCACGCGTGGCGCCTCCGGCTGGAGATGCG 240 
GCTGCTAGAGGGACGACCGCGCCCCCTCTGGCACCCGGGG 280 
CGACCGTGGCGCTGCTCCTCCCAGCGGGCCCGGATTTCCT 320 
TTGGATTTGGTTCGGACTGGCCAAAGCTGGCCTGCGCACG 360 
GCCTTTGTGCCCACCGCTTTACGCCGAGGACCCCTGCTGC 400 
410 420 430 440 

' ' » ' * ' * ' t I t l i l I t 1 f t I t t I , I I 1 I I I I I t . 1 I I l . r I 



ACTGCCTCCGCAGCTGCGGTGCGAGTGCGCTCGTGCTGGC" 440 
CACAGAGTTCCTGGAGTCCCTGGAGCCGGACCTGCCGGCC 480 
TTGAGAGCCATGGGGCTCCACCTATGGGCGACGGGCCCTG 520 
AAACTAATGTAGCTGGAATCAGGAATTTGCTATCGGAAGC 560 
AGCAGACCAAGTGGATGAGCCAGTGCCGGGGTACCTCTGT 600 

610 620 630 640 

■ » « ' ' . . t i i ■ t i ■ t > i t t 1 t 1 t i i i. i i i I i i t t i , . t . i 



GCCCCCCAGAACATAATGGACACCTGCCTGTACATCTTCA 640 
CCTCTGGCACTACTGGCCTGCCCAAGGCTGCTCGAATCAG 680 
TCATCTGAAGGTTCTACAGTGCCAGGGATTCTACCATCTG 720 
TGTGGAGTCCACCAGGAGGAGGTGATCTACCTCGCACTCC 760 
CAC TG TAC C AC ATGTCTG.GCTCCCTTCTGGGC A TTGTGGG 800 

810 820 830 840 
* ' * » 1 > i i *- 1 i i i i i i i i t 1 i i i i 1 i i t i I i i 1 . i > , i . t 



CTGCTTGGGCATTGGGGCCACCGTGGTGCTGAAACCC A AG 840 
TTC TC AGCTAGCC AG TTCTGGGACG AT TGCCAGAAACAC A 880 
GGGTGACAGTGTTCC AGTACATTGGGGAGTTGTGCCGATA 920 
CC TCG TC A ACC AGC CC CCGAGCAAGGC AG AG TTTGACC AT 960 
AAGGTGCGCTTGGCAGTGGGCAGTGGGTTGCGCCC AGACA 1000 

1010 1020 1030 1040 

* ' ' ' 1 V ' - » ■ 1 ■ ' ' ' 1 ' ' * ' 1 ' f * ' * ' 1 ' ' 1 ' ' 1 * * ' ' ' ' * „ 



CCTGGGAGCGTTTCCTGCGGCGATTTGGACCTCTGCAGAT 1040 
AC TGG AGACG T A TGGCATG AC AG AGGGC A ACGT AGC T AC G 1080 
TTCAATTACACAGGACGGCAGGGTGCAGTGGGGCGAGCTT 1 120 
CCTGGCTTTACAAGCACATCTTCCCCTTCTCCTTGATTCG t 160 
A T ACGATGTC A TG AC AGGGG AGC CT ATT CGGAATGCCC AG 1200 



WO 99/36537 



PCT/US99/00182 



79/117 



mmFATP3 partial. DN A 



1210 




1240 



i 



GGGCACTGCATGACCACATCTCCAGGTGAGCCAGGCCTAC 1240 

TGGTGGCCCCAGTGAGCCAGCAGTCCCCCTTCCTGGGCTA 1280 

TGC TGGGGC TCCGGAGC TGGCC A AGG AC AAGC TGC TG A AG 1320 

GATGTCTTCTGGTCTGGGGACGTTTTCTTCAATACTGGGG 1360 

ACC TC TTGG TC TG TG A TGAGCAAGGCTTTCTTC AC TTCC A 1400 

1410 1420 1430 1440 



CGATCGTACTGGAGACACCATCAGGTGGAAGGGAGAGAAT 1 440 
GTGGCCACAACTGAAGTGGCTGAGGTCTTGGAGACCCTGG 1480 
ACTTCCTTCAGGAGGTGAACATCTATGGAGTCACGGTGCC 1520 
AGGGCACGAAGGCAGGGCAGGCATGGCGGCCTTGGCTCTG 1560 
CGGC C CCCGC AG GCTCTG A ACC TGG TGC AGCTCTACAGCC 1600 

1610 1620 1630 1640 

I. I I 1 I t I t t \ I I 1 1 I I 1 I , T 1 1 1 I I t I I 1 I , I I 11 I f 1 I t I i 

ATGTTTCTGAGAACTTGCCACCGTATGCCCGACCTCGGTT 1640 

TCTCAGGCTCCAGGAATCTTTGGCCACTACTGAGACCTTC 1680 

AAACAGCAGAAGGTTAGGATGGCCAATGAGGGCTTTGACC 1720 

CCAGTGTACTGTCTGACCCACTCT'ATGTTCTGGACCAAGA 1760 

TATAGGGGCCTACCTGCCCCTCACACCTGCCCGGTACAGT 1800 

1810 1820 1830 1840 

« i * i. I * » > ' I t t t r \ t s t t \ i i i i 1 t f i i I ) i I r I i , t i ) 

GCCCTCCTGTCTGGAGACCTTCGAATCTGAAACCTTCCAC 1840 
TTGAGGGAGGGGCTCGGAGGGTACAGGCCACCATGGCTGC 1880 
ACCAGGGAGGG7TTTCGGGTATCTTTTGTATATGGAGTCA 1920 
TTATTTTGTAATAAACAGCTGGAGCTTAAAAAAAAAAAAA 1960 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1998 



i i I i 1 i i i r I t t i i L i i i i 1 i i t i I i i 



i i i i 1 i i i i I 
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mmFATP3 partiaLprotein 



10 20 30 40 - 

* < < < * ■ * * « ) . ... t .... i . .-. , i ,<.,), , , , r . r . . i 

ESSESGCSLAWRLAYLAREOPTHTFL IHGAGRFSYAEAER 40 
ESNRI ARAFLRARGWTGGRRGSGRGSTEEGARVAPPAGDA 80 
AARGTTAPPLAPGATVALLLPAGPDFLWI WFGLAKAGLRT 120 
AFVPTALRRGPLLHCLRSCGASALVLATEFLESLEPOLPA 160 
LRAMGLHLWATGPETNVAG I SNLLSEAADQVDEP VPGYLS 200 
210 220 230 240 

I I I 1 I T 1 I ,1 I 1 1 1 I f , I l„ f t f 1 t f : I \ I I I t f t 1 1 t f 1 | 1 T I 

APQNIMDTCLY IFTSGTTGLPKAAR ISHLKVLQCQGFYHL .240 
CGVHQEDV I YLALPLYHMSGSLLG I VGCLG I GATV VLKPK 280 
FSASGFWDOCQKHRVTVFQY I GELCRYLVNQPPSKAEFDH 320 
KVRLAVGSGLRPDTWERFLRRFGPLOILETTGMTEGNVAT 360 
FNYTGRGGAVGRASWL YKH I FPFSL I R YDVMTGEP I RNAQ- 400 

410 420 430 440 

■ ■ < i i i i i i f ■ t t i i i t * i 1 i i t i i i i i i . I i .t > i i t t i i f 

GHCMTTSPGEPGLLVAPVSQQSPFLGYAGAPELAKDKLLK 440 
DVFWSGDVFFNTGOLLVCQEQGFLHFHORTGDT I RWKGEN .480 
VATTEVAEVLETLDFLQEVN I YGVTVPGHEGRAGMAALAL 520 
RPPQALNLVGLYSHVSENLPPY ARPRFLRLQESLATTETF 560 
KQQKVRMANEGFDPSVL.SDPLYVLDGDIGAYLPLTPARYS 600 

610 620 630 640 
. . . . i ■ . . . t . , . , i . . . . i ■ , . , t . . , , t . , , . i , . . . ( 

ALLSGDLRI. 610 



Fr«. 6>7 
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mmFATP4 full length.DNA 



10 20 30 40 

■ « ■ > i ■ « ■ i 1 * ■ ■ ■ t ' ■ » » t * ■ ' ' * ■ * ' * t * ■ ■ * f f * * t i 
ATGCTGCTTGGAGCCTCTCTGGTGGGGGCGCTACTGTTCT 40 
CCAAGCTAGTGCTGAAGCTG'CCCTGGACCCAGGTGGGATT 80 
CTCCCTGTTGCTCCTGTACTTGGGGTCTGGTGGCTGGCGT 120 
TTCATCCGGGTCTTCATCAAGACGGTCAGGAGAGATATCT 160 
TTGGTGGCATGGTGCTCCTGAAGGTGAAGACCAAGGTGCG' 200 

210 220 230 240 

■ * ' ■ 1 ' * * * * * * » » * * ■ » * l * > * 1 1 > * 1 > * * * * ' 1 - * * t i 
ACGGTACCTTCAGGAGCGGAAGACGGTGCCCCTGCTGTTT 240 
GCTTCAATGGTACAGCGCCACCCGGACAAGACAGCCCTGA 280 
TTTTCGAGGGCACAGACACTCACTGGACCTTCCGCCAGCT 320 
GGATGAGT AC TCC AG TAG TGTGGCC A AC TTCCTGCAGGCC 360 
CGGGGCC TGGC CTCAGGC A A TGTAGTTGCCCTCTTT A TGG 400 

410 420 430 440 

■ ■ »--» 1 1 I I I f I I I t I f 1 I « f I t I I 1 I I 1 1 f I « i r I . . . , 1 

AAAACCGCAATGAGTTTGTGGGTCTGTGGCTAGGCATGGC 440 
CAAGCTGGGCGTGGAGGCGGCTCTCATCAACACCAACCTT 480 
AGGCGGGATGCCCTGCGCCACTGTCTTGACACCTCAAAGG 520 
CACGAGCTCTCATCTTTGGCAGTGAGATGGCCTCAGCTAT 560 
CTGTGAGATCCATGC7AGCCTGGAGCCCACACTCAGCCTC 600 
610 620 630 1 640 

t r * t 1 ' i l ' I i t i r ) I I l I [ I t ! I \ l I t l ] t , , ^ \ , , , , \ 

TTCTGCTCTGGATCCTGGGAGCCCAGCACAGTGCCCGTCA 640 
GCACAGAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAA' 680 
GCACCTGCCCAGTCACCCAGACAAGGGTTTTACAGATAAG 720 
CTCTTCTACATCTACACATCGGGCACCACGGGGCTACCCA 760 
AAGCTGCCATTGTGGTGCACAGCAGGTATTATCGTATGGC 800 

810 820 830 840 
.... i * . r .... i t * . I .... i ... ,_i 

TTCCCTGGTGTACTATGGA-T-TCCGCATGCGGCCTGATGAC 840 
ATTGTCTATGACTGCCTCCG'COTCTACCACTCAAGCAGGA 880 
AAC ATCGTGGGG A T TGGC AG T GC TT AC TCCACGGC A TG AC 920 
TG TGG TG A TCCGG A AG A AG TTC TCAGCCTCCCGGTTCTGG 960 
GATGATTGTATCAAGTACAACTGCACAGTGGTACAGTACA 1000 
1010 1020 1030 1040 

" ' ' ' < ' « ■ I t * t . * > f » » I > . i i I « i « . > r * i . I .... I 

TTGGCGAGCTCTGCCGCTACCTCCTGAACCAGCCACCCCG 1040 
TGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCACTGGGC 1080 
A ACGG TC TCCGG C AG TCC A TC TGG ACC GAC TTC TCC AG CC 1 120 
GTTTCCACATCCCCCAGGTGGCTGAGTTCTATGGGGCCAC 1 160 
TGAATGCAACTGTAGCCTGGGC AACTTTGACAGCCGGGTG 1200 



hv C2A 
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mmFAT F4 full length.DNA 



1210 1220 1230 1240 

■ > 1 < ■ ' ■ 1 > ■ ■ » ' ■ * * « t 1 ' * ' i ■ ■ * ' 1 ' « < * ' « ■ ■ ■ i 



GGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCTTTGTGT 1240 

ACCCTATCCGTTTGGTACGTGTCAATGAGGATACCATGGA 1280 

ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAA 1320 

CCAGGTCAGCCAGGCCAGCTGGTGGGTCGCATCATCCAGC 1360 

AGGACCCTCTGCGCCGTTTCGACGGGTACCTCAACCAGGG 1400 

1410 1420 1430 1.440 
* ' ' * ' * * « * 1 t i « i . i t « f -i J t t t t I f . t * t l t t t i i , , < , ) 

TGCCAACAACAAGAAGATTGC TAATGATGT'CTTCAAGAAG 1 440 
GGGGACCAAGCCTACCTCACTGGTGACGTGCTGGTGATGG 1480 
ATGAGCTGGGTTACCTGTACTTCCGAGATCGCACTGGGGA 1520 
C ACGTTCCGCTGG A A AGGGG AG A ATGT A TCTACC AC TGAG 1560 
GTGGAGGGCACACTCAGCCGCCTGC TTCATATGGCAGATG 1600 

1610 1620 1630 1640 

I » i 1 * t I t t I 1 I I I f I I I ■ I I I I I I I I »• L I ■ I I t I I- t I I- I I 1 



TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCG 1640 
AGCAGGAATGGCTGCCGT'TGCAAGTCCCATCAGCAACrGT 1680 
GACCTGGAGAGCTTTGCACAGAGCTTGAAAAAGGAGGTGC 1720 
CTCTGTATGCCCGCCCCATCTTCCTGCGCTTCrrGCCTGA 1760 
GC TGCAC A AG AC AGGGACCTTCAAGTTCCAGA AG AC AGAG 1800 

1810 1820 1830 1840 

)..t 1 I f J 1 1 I f I T < I ! It / I f t I I i I I I I 1 I 1 I i i t t I 1 1 f 



TTGCGG AAGG AGGGC TTTG AC CC A TCTGTTGTGAA AG AC C 1840 
CGCTGTTCTATCTGGATGCTCGGAAGGGCTGCTACGTTGC 1880 
ACTGGACCAGGAGGCCTATACCGGCATCCAGGCAGGCGAG- 1920 
GAGAAGCTGTGATTTCCCCCTACATCCCTCTGAGGGCCAG. 1960 
AAGATGCTGGATTCAGAGCCCTAGCGTCCACCCCAGAGGG 2000 

2010 2020 2030 2040 

f , ... t , i . i . . . . I . » . ■ i . , . . t 



TCCTGGGCAATGCCAGACCAAAGCTAGCAGGGCCCGCACC 2040 
TCCGCCCCTAGGTGCTGATCTCCCCTCTCCCAAACTGCCA 2080 
AG TG AC TC AC TGCCGCTTCCCCGACCCTCC AG AGGCTTTC 2120 
TGTGAAAGTCTCATCCAAGCTGTGTCTTCTGGTCCAGGCG 2160 
TGGCCCCTGGCCCCAGGGrTTCTGATAGGCTCCTrTAGGA 2200 

2210 2220 2230 .2240 

■ ■ ■ > t * ■ ■ » * * * * ■ 1 « ■ * ■ 1 * < * * 1 1 * * ■ 1 ■ ■ * ■ * ■ ' * ' 1 



TGGTATCTTGGGTCCAGCGGGCCAGGGTGTGGGAGAGGAG.2240 
TCACTAAGATCCCTCCAATCAGAAGGGAGCTTACAAAGGA 2280 
ACCAAGGCAAAGCCTGTAGACTCAGGAAGCTAAGTGGCCA 2320 
GAGACTATAGTGGCCAGTCATCCCATGTCCACAGAGGATC 2360 
TTGGTCCAGAGCT6CCAAAGTGTCACCTCTCCCT6CCTGC 2400 
2410 < 2420 -2430 2440 

i t i t 1 r. r t i 1 i i i i 1 i i i t I i t i i I i i, i i 1 i i i \ \ i i i i I 



ACCTCTGGGGAAAAGAGGACAGCATGTGGCCACTGGGCAC 2440 
C TG TC TCA AG AAG TCAGG A TC AC AC AC TC AG TCCTTGTTT 2480 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACG 2520 
AGTGTCCTGTCTGTCCTTCC TGCCTGTCTGTGAGTCTGTG 2560 
TTGCTTCTCCATCTGTCCTAGCCTGAGTGTGGGTGGAACA 2600 



R 4 . 6$£> 



WO 99/36537 



PCT/US99/00182 



83/117 

mmFATP4 full length.DNA 

2610 2620 2630 2640 

■ ■ * ■ i * * * ' 1 * * * ■ 1 > « ■ * 1 * > ■ * 1 * * > ■ I * < ■ » 1 ■ * ■ ■ I 
GGCATGAGGAGAGTGTGGCTCAGGGGCCAATAAACTCTGC 2640 
CTTGACTCCTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2680 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 27 1 0 
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mmFATP4 full length.protein 



10 20 * 30 40 

I i I I I t 1 I l I f 1 1 I I I I t 1 T I 1 t l i I I f 1 1 T I 1 I t I t f 1 1 t . f 

MLLGASLVGALLFSKLVLKLPWTQVGFSLLLLYLGSGGWR 40 
F IRVF I KTVRRO I FGGMVLLKVKTKVRRYLGERKTVPLLF 80 
ASMVQRHPDKTAL IFEGTDTHWTFRGLDEYSSSVANFLGA 120 
RGLASGNVVALFMENRNEFVGLWLGMAKLGVEAAL I NTNL 160 
RRDALRHCLDTSKARAL I FGSEMASA I CE I HASLEPTLSL 200 

210 220 230 240 

i i i i I i i i i t i i i i t i - i i i I t i i i f i i i i f i i i t f i i i i I . 

FCSGSWEPSTVPVSTEHLDPLLEDAPKHLPSHPOKGFTDK 240 
LFY I YTSGTTGLPKAAI VVHSRYYRMASLVYYGFRMRPDO 280 
I VYOCLPLYHSSRKHRGDWQCLLHGMTVV I RKKFSASRFW 320 
OOC IKYNCTVVQY IGELCRYLLNGPPREAESRHKVRMALG 360 
NGLRGSIWTDFSSRFHIPQVAEFYGATECNCSLGNFDSRV' 400 

410 > 420 430 440 

»-< ■ « 1 ' » » * * 1 * ■ ■ 1 ' 1 * ' I * 1 ■ ■ ( ■ 1 ■ 1 * ■ ' « ■ 1 ' ■ * * * 
GACGFNSR TLSFVYP IRLVRVNEDTMEL IRGPDGVC IPCQ 440 
PGQPGQLVGRI I QQDPLRRFOG YLNGGANNKK I ANOVFKK 480 
GDOAYLTGOVLVMOELGYLYFRORTGOTFRWKGENVSTTE 520 
VEGTLSRLLHMAOVAVYGV EVP GTEGRAGMAAV ASP I SNC 560 
DLESFAGTLKKELPLYARP fFLRFLPELHKTGTFKFQKTE 600 
610 620 630 640 

' ■ * » ' 1 ' ' 1 1 1 ' ' 1 1 1 ■ ■ 1 F 1 1 ' ' 1 ' 1 ■ 1 f 1 ' 1 ' ( 1 ' * ' I 
LRKEGFDPSVVKDPLFYLDARKGCYVALDQEAYTR 1 QAGE 640 
EKL 644 
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mmFAT F5 full length.DNA __ 



10. 20 30 40 

■ » ' * i t . . . l i i i t i i i t i f i < i i i t ) i i ( i < . t i i t ■ i i 

C ACTC A TC AG AGC T A AGAG AG ACT AC ACGCTCTC A TCTAC 40 
TTCAGAAAGAGCCAATGCCATGGGTATTTGGAAGAAACTA 80 
ACCTTACrGCTGTTGCTGCTTCTGCTGGTTGGCCTGGGGC 120 
AGCCCCCATGGCCAGCAGCTATGGCTCTGGCCCTGCGTTG 160 
GTTCCTGGGAGACCCCACATGCCTTGTGCTGCTTGGCTTG 200 

210 220 ' 230 240 

' 1 ' ■ 1 * « 1 ' 1 i i i i I i t t . I * t- t i I i t i t 1 ttt.! i i i i I 

GCATTGCTGGGCAGACCCTGGArCAGCTCCTGGATGCCCC 240 
ACTGGCTGAGCCTGGTAGGAGCAGC TCTTACCTTATTCCT 280 
ATTGCCTCTACAGCCACCCCCAGGGCTACGCTGGCTGCAT 320 
A A AG A TG TGGC TT TC ACCTTC A AG A TGCTTTTCT A TGGCC 360 
TAAAGTTCAGGCGACGCCTT AACAAACATCCTCCAGAGA'C 400 

410 420 430 440 

* * * ' * ' ' 1 ' 1 * ' > > t ' ' * * i > 1 1 '■■ iitti\iitt)tiiri 
CTTTGTGGATGCTTTAGAGCGGCAAGCACTGGCATGGCCT 440 
GACCGGGTGGCCTTGGTGTGTACTGGGTCTGAGGGCTCCT 480 
CAATCACAAATAGCCAGCTGGATGCCAGGTCCTGTCAGGC 520 
AGCATGGGTCCTGAAAGCAAAGCTGAAGGATGCCGTAATC 560 
CAGAACACAAGAGATGCTGCTGCTATCTTAGTTCTCCCGT 600 

610 620 630 640 

t t i t f t i f i I t ( t f f i r f , i f f f f t f r t i f f f t i ,.f I i i i i J 

CCAAGACCATTTCTGC'TTTGAGTGTGTTTCTGGGGTTGGC 640 
C AAG T TGGGC TGCCC TG TGGCC TGG A TC A A TCC AC AC AGC 680 
CGAGGGATGCCCTTGCTACACTCTGTACGGAGCTCTGGGG 720 
CC AG TG TGC TG A T TG TGG A TC C AG AC C TCC AGG AG A A CCT 760 
GGAAGAAGTCCTTCCCAAGCTGCTAGCTGAGAACATTCAC 800 
810 820 830 840 

1 1 1 1 I I 1 1 I I 1 I I I 1 I I 1 1 f t I I I I 1 t I 1 I I 1 T 1 ( 1 I 1 I f 

TGCTTCTACCTTGGCCACAGCTCACCCACCCCGGGAGTAG 840' 
AGGC TC TGGG AGC T TCC C TGG A TGC TGC ACCTTCTG AC CC 880 
AGTACCTGCCAGCCTTCGAGC TACGATTAAGTGGAAATCT 920 
CCTGCCATATTCATCTTTACTTCAGGGACCACTGGACTCC 960 
CAAAGCCAGCCATCTTATCACATGAGCGGGTCATACAAGT 1000 

1010 1020 1030 1040 

,i i i ,i, I i i ,i i I i i i \ f \ i r \ t i i i ,i I i i t i I t t t i, L t t i i ( 

GAGC A ACG TGC TG TCC T TC TG TGG A TGC AG AGC TG A TG AT 1040 
GTGGTCTATGACGTCCTACCTCTGTACCATACGATAGGGC 1080 
TTG TCC TTGGA T TC C T TGG C TGC TTAC A AG T TGG AGCC AC 1 120 
CTGTGTCCTGGCCCCCAAGTTCTCTGCCTCCCGATTCTGG 1 160 
GCTGAGTGCCGGCAGCATGGCGTAACAGTGATCTTGTATG 1200 
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86/117 



1210 1220 1230 1240 

■ * I ■ ' > ' ' ■ * ' ' 1 ' ■ ■ ■ i 1 ■ * ' 1 ■ 1 i ■ 1 * ■ > ■ I 



TGGGTGAAATCCTGCGGTACTTGTGTAACGTCCCTGAGCA 1240 
ACCAGAAGACAAGATACATACAGTGCGCTTGGCCATGGGA 1280 
ACTGGACTTCGGGCAAATGTGTGGAAAAACTTCCAGCAAC 1320 
GCTTTGGTCCCATTCGGATCTGGGAATTCTACGGATCCAC 1360 
AGAGGGCAATG7GGGCTTAATGAACTATGTGGGCCACTGC 1400 

1410 1420 1430 1440 

■ » ■ ■ 1 * * ■ ' * * * * ■ i * ■ ' ' 1 ' * » ■ 1 ■ ■ » * t ■ ■ * * 1 * * > ■ i 



GGGGCTGTGGGAAGGACCAGCTGCATCCTTCGAATGCTGA 1440 

CTCCCTTTGAGCTTGTACAGTTCGACATAGAGACAGCAGA 1480 

GCCTCTGAGGGACAAACAGGGTTTTTGCATTCCTGTGGAG 1520 

CC AGGAA AGCC AGG AC T TC T TTTGACC A AGGTTCGAAAG A 1560 

ACCAACCCTTCCTGGGCTAOCGTGGTTCCCAGGCCGAGTC 1600 

1610 1620 1630 1640 

, <„ t i ( | y i , I J i ■ [ t > t ■ * i I ■ i t t I i i i i i i > t . f ) t i .1 f 



CAATCGGAAACTTGTTGCGAATGTAGGACGCGTAGGAGAC 1640 
CTGTACTTCAACACTGGGGACGfGCTGACCTTGGACCAGG 1680 
A AGGC T TC T TC TAG TTTC A AG AC CGCCTTGGTG AC ACCTT 1720 
CCGGTGGAAGGGCGAAAACGTATCTACTGGAGAGGTGGAG 1760 
TG TG TTTTG TCTAGCC TAG AC TTCCTAGAGGA AG TCAATG 1800 

1810 1820 1830 1840 
«» i i i «■» * 1 i i ■ i i * i i t 1 i i-> t i i i i i I i i i i i i i i i i 



TCTATGGTGTGCCTGTGCCAGGGTGTGAGGGTAAGGTTGG 1840 
CATGGCTGCTGTGAAACTGGCTCCTG'GGAAGACTTTTGAT" 1880 
GGGCAGAAGCTATACCAGCATGTCCGCTCCTGGCTCCCTG 1920 
CCTATGCCACACCTCATTTCATCGGTATCCAGGATTCCCT 1960 
GGAGATCACAAACACCTACAAGCTGGTAAAGTCACGGCTG 2000 
2010 2020 2030 2040 

.1.1 t I t. I t t \ \ T I I 1 I I I .1 f 1 T I I I I 1 I t t I I I 1 I 1 I 1 I I I 



GTGCGTGAGGGTTTTGATGTGGGGATCATTGCTGACCCCC 2040 
TCTACATACTGGACAACAAGGGCCAGACCTTCCGGAGTCT 2080 
GATGCCAGATGTGTACCAGGCTGTGTGTGAAGGAACCTGG 2 120 
AATCTCTGACCACCTAGCCAACTGGAAGGCAATCCAAAAG 2160 
TGTAG AG A TTGAC AC T AG TC AGC TTC AC A A AG TTGTCCGG 2200 

2210 2220 2230 2240 
> ■ < t t i . , . I . . . . i . t . . I > . i t i i * t ■ l i i t ■ i , 1 , » I . 



GTTCCAGATGCCCATGGCCCAGTAGTACTTAGAGAATAAA 2240 
CTTGAATGTGTATACAAAAAAAAAAAAAAAAAAAAAA 2277 



fy 70 B 
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mmFATPS fuHJength.protein 



10 20 30 40 

. . . . i » . i . t . . . » i . ■ . » i ■ « » ■ i ■ » » i I < n » t ■ ■ . ■ i 
MG I WKKLTLLLLLLLIVGLGQPPWPAAMALALRWFLGDPT 40 
CLVLLGLALLGRPWI SSWMPHWLSLVGAALTLFLLPLGPP 80 
PGLRWLHKDVAFTFKMLFYGLKFRRRLNKHPPETFVDALE 120 
RGALAWPDRVALVCTGSEGSS I TNSQLOARSCQAAWVLKA 160 
KLKDAV I QNTRDAAAILVLPSKT ISALSVFLGLAKLGCPV 200 

210 220 230 240 

•» ■ * ■ i » i t » I i ii i t r f it \ i i i t i t t 1. 1 1 it ft i i « i i J 

Atfl NPHSRGMPLLHSVRSSGASVL I VOPDLOENLEEVLPK 240 
LLAEN I HCFYLGHSSPTPGVEALGASLDAAPSDPVPASLR 280 
ATIKWKSPAI F IFTSGTTGLPKPAILSHERV IQVSNVLSF 320 
CGCRADDVVYDVLPLYHT I GL VLGFLGCLQ VGATCVLApK 360 
FSASRFWAECRQHGVTV I LYVGE I LRYLCNVPEGPEDK I H 400 

410 420 ■ 430 440 

* ■ ■ » 1 ' * ■ ■ 1 ' ■ * ■ f * * 1 1 * * ■ ■ ■ 1 ■ 1 ■ ■ i 1 ■ 1 ■ ' * ■ * 1 1 
TVRLAMGTGLRANVWKNFQQRFGP I R I WEFYGSTEGNVGL 440 
MNYVGHCGAVGRTSC I LRMLTPFELVGFD I ETAEPLROKQ 480 
GFC IPVEPGKPGLLLTK VRKNGPFLGYRGSGAESNRfCLVA 520 
NVRRVGDLYFNTGDVLTLDOEGFFYFQDRLGDTFRWKGEN 560 
VSTGEVECVLSSLDFLEEVNVYGVPVPGCEGKVGMAAVKL 600 

610 620 630 640 

i i i t i i i i i t i 1 iti t t i i i t I i i > i t t i i i t i t i i I i i i i J . 

APGKTFOGGKLYGHVRSWLPAY ATPHF I R I GDSLE I TNTY 640 
KLVKSRLVREGFOVGI I ADPLY I LONKAGTFRSLMPDVYG 680 
AVCEGTWNL. 690 



f<A- 71 
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dmFATP partiaLDNA 



10 20 30 40 
i . « . i . . . . I . i . » i . . . , i , , t , 1 i 

GCTCTCTGGGCCTATATCAAGCTGCTGAGGTACACGAAGC 40 
GCCATGAGCGGCTCAACTACACGGTGGCGGACGTCTTCGA 80 
ACGAAATGTTCAGGCCCATCCGGACAAGGTGGCTGTGGTC 120 
AGTGAGACGCAACGCTGGACCTTCCGTCAGGTGAACGAGC 160 
ATGCGAACAAGGTGGCCAArGTGCTGCAGGCTCAGGGCTA 200 

210 220 230 240 

■ ■ ■ ■ i 1 * * * l » * » « 1 * « * ■ i ■ * « * 1 ' * ' » f « ■ * ■ 1 * » ■ ■ i 
CAAAAAGGGCGATGTGGTGGCCCTGTTGCTGGAGAACCGC 240 
GCCGAGTACGTGGCCACCTGGCTGGGTCTCTCCAAGATCG 280 
GTGTGATCACACCGCTGATCAACACGAATCTGCGCGGTCC 320 
CTCCCTGCTGCACAGCATCACGGTGGCCCATTGCTCGGCT 360 
CTCATTTACGGCGAGGACTTCCTGGAAGCTGTCACCGACG 400 

410 420 430 440 

' 1 1 ' * ' ■ ' ' 1 * ' ' ' f ' ■ ' ■ t i -i t iLllllLlli i t t t i t f 

TGGCCAAGGATCTGCCAGCGAACCTCACAGTCTTCCAGTT 440 
CAACAACGAGAACAACAACAGCGAGACGGAAAAGAACATA 480 
CCGCAGGCCAAGAATCTGAACGCGCTGCTGACCACGGCCA 520 
GCTATGAGAAGCCTAACAAGACGCAGGTTAACCACCACGA 560 
CAAGCTGGTCTACATCTACACCTCCGGCACCACAGGAPTG 600 

610 620 630 640 

i * i- * i i i i i I i i > i i t i i i l i i i i i < i i i f i 1. 1 i i < i i t 1 
CCAAAGGCTGCGGTTATCTCTCACTCCCGTTATCTGTTTA 640 
rCGCTGCTGGCATCCACTACACCATGGGTTTCCAGGAGGA 680 
GGACATCTTCTACACGCCCTTGCCTTTGTACCACACCGCT 720 
GGTGGCATTATGTGCATGGGTCAGTCGGTGCTCTTTGGCT' 760 
CC ACGGTCTCC AT TCGG A AG A AG TTCTCGGCATCCAACTA 800 

810 820 830 840 

i > ■ i i i t i t l i * » i i i t t t l i i i t > i i i i I i i i > i i i « i I 

TTTCGCCGACTGCGCCAAGTATAATGCAACTATTGGTCAG 840 
TATATCGGTGAGATGGCTCGCTACATTCTAGCTACGAAAC 880 
CCTCGGAATACGACCAGAAACACCGAGTGCGTCTGGTCTT 920 
TGGAAACGGACTGCGACCGCAGATTTGGCCACAGTTTGTG 960 
CAGCGCTTCAACATTGCCAAGGTTGGCGAGTTCTACGGCG 1000 

1010 1020 1030 1040 

■ l.i-X-i. I i.t llLli i i ! i i. i i i i > i.i l. i ,j_ i_ i LllllLlli I I 

CCACCGAGGGTAATGCGAACATCATGAATCATGACAACAC 1040 
GGTGGGCGCCATCGGCTTTGTGTCGCGCATCCTGCCCAAG 1080 
ATCTACCCAATCTCGATCAT TCGCGCCGATCCGGACACCG 1 120 
GAGAGCCCATTAGAGATAGGAATGGCCTATGCCAACTGTG 1 160 
CGCTCCCAACGAGCCAGGCGTATTCArCGGCAAGATCGTC 1200 



fie.. 73- A 



WO 99/36537 



PCT/US99/00182 



droFATP partiaLDNA 



89/117 



1210 1220 1230 1240 

* * f * ■ ■ ' t < t t t t t i i i t t * t t \ i i i i t i . . t f 



AAAGGAAATCCTTCTCGCGAATTCCTCGGATACGTCGATG 1240 
AAAAGGCCTCCGCGAAGAAGATTGTTAAGGATGTGTTCAA 1280 
GCATGGCGATATGGCTTTCArCTCCGGAGATCTGCrGGTT 1320 
GCCGACGAGAAGGGTTATCTGTACTTCAAGGATCGCACCG 1360 
GTGACACCTTCCGC TGGAAGGGCGAGAATGTTTCCACCAG 1400 
1410 1420 1430 1440 

t.l I I.. I ■ « « « I i » I I I I 1 I 1 1 I I I I I I F I 1 I ) « ■ . I ■ .1 I I I _ 



CGAGGTGGAGGCGCAAGTCAGCAATGTGGCCGGTTACAAG 1440 

G A TACCG TCGTTT ACG G CGTAACC A "TTCCGC AC ACCGAGG 1480 

GAAGGGCCGGCATGGCCGCCATCTATGATCCGGAGCGAGA 1520 

ATTGGACCTCGACGTCTTCGCCGCTAGCTTGGCCAAGGTG 1560 

C TG CCC GC G T ACGC TCG TC CCC AG ATC A TTCG A TTGCTC A 1600 

1610 1620 1630 1640 
* » ' ' 1 ' ' ■ » 1 ■ ' ■ ■ 1 ' » » * 1 ' ' ' 1 I 1 1 > '» 1 ' * * ■ 1 ' * ■ r ! 



CCAAGGTGGACCTGACTGGAACCTTTAAGCTGCGCAAGGT 1640 
AGACCTGCAGAAGGAGGGCTACGATCCGAACGCGATCAAG 1680 
GACGCGCTGTACTACCAGAGTTCCAAGGGTCGGTACGAGC 1720 
TGC TCACGCCCC AGG 71 TACGACCAGGTGCAGCGC A ACG A 1760 
AArCCGCrTCTAAGAGCTGCAATAGAGTTGTGTCTGAACC 1800 
1810 1820 . 1830 1840^ 

■ ■ * ■ f ■ ■ ■ ■ 1 ■ ■ ■ * 1 » ■ 1 1 > » * * 1 f 1 T 1 ' F ' 1 ■ ' ' ■ ■ i t f 



TTGCCTTTTGCCCAATATGCTGTTAATTAGTTTGTAAGGC 1840 
TAAGTGTAGTAGAGGAAAATCGGGGGAAATCGGCAGCAAA 1880 
GATCATTCAGCCTAGGAGAGATGCATCCGAAGCACA.TTTC ' 1920 
CATGTCAACAATGCACT-TTTGTATATCGTAAGCATATATA I960 
TATCGTATATCGTAAACGtAGTTGTATCTGCATTTGTGTA 2000 
2010 2020 2030 2040 

I I 1 I [ I 1 1.1 f I I I 1 f I 1 I I I, I t I ,1 I I 1 I 1 I I I I I f I I |„|.t, 



GATGATAGCCTCCTATACGCATTTCAATTGTTTTTAGCGT 12040 
GCTAAAGAACCTTGTTAAATGCAATTTCAGCTATTGTTTA 2080 
GTCAGTTT TAG TGGC A TTTAC AC TTCC A TTCTCGTTGCGT 2120 
TTCGTTTTTGCCTG TACATATGAGAAGC TCTGATGT"TTTr 2160 
GTATCAAATAAAGTTTTTTCCTTCACCACGGACCACGTGA 2200 

2210 2220 2230 2240 
i i i i I i i i i f i i i t I i i i i f t i i i f i i i i f t i i i I i \ i, i, f 



AAAAAAAAAAAAAAAAAAAAA 2221 
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90/117 

dmFATP partial.orotein 



i 



10 20 30 40 

* < ». ■ t ■ ■ ■ ■ I » * ■ » t ' * * > f * * ' « * ' '-■ ■ J * » ' 1 * 1 ' * ' F 

ALWAYIKLLRYTKRHERLNYTVADVFERNVQAHPOKVAVV 40 
SETQRWTFRQVNEHANKVANVLQAQGYKKGOVVALLLENR* 80 
AEYVATWLGLSK EGV I TPL I NTNLRGPSLLHS I JVAHCSA 120 
L 1 YGEDFLEAVTDVAKDLPANLTLFQFNNENNNSETEKN I 160 
PQAKNLNALLTTASYEKPNKTGVNHHDKLVY I YTSGTTGL 200 

210 220 230 240 
« > > > * ' ' * 1 » t i t « > t i i * f l t i i i I i t ii I f it * \ t t > i I 

PKAAV I SHSRYLF I AAG I HYTMGF.QEED I FYTPLPLYHTA 240 
GGI MCMGQSVLFGSTVS I RKKFSASNYFADCAKYNAT IGG 280 
Y I GEMARY ILATKPSEYDQKHRVRLVFGNGLRPQ I WPQFV 320 
QRFN I AK VGEFYGATEGNAN I MNHONTVG A I GF VSR I LPK 360 
I YP IS I I RADPDTGEP I RDRNGLCQLCAPNEPG VF I GK I V 400 
410 420 - 430 440 

1 1 i i \ t i i i t i i i i 1 i i t i f i i 7 i f i Li if \ i i i I i i i i t 

KGNPSREFLGYVDEKASAKKI VKDVFKHGDMAF ? SGDLLV 440. 
ADEKGYLYFKDRTGDTFRWKGENVSTSEVEA.QVSNVAGYK, 480 
OT VVYGVT IPHTEGRAGMAAI YOPERELDLDVF AASLAKV 520 
LPAYARPQ I IRLLTKVDLTGTFKLRKVDLQKEGYDPNA I K 560 
OALYYQTSKGRYEt-LTPGVYDGVQRNE IRF 590 



WO 99/36537 
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91/117 

drFATP partial.DNA 



10 .20 30 40 

' ' ■ ' I t t i 1 I r t i f I t t t r I f f 1 ( I f f r f I t t f t f t t i i I 

AGTGTAGATACCACAGGAACGTTTAAAATCCAGAAGACCA 40 
GACTGCAAAGGGAAGGATACGATCCACGGCTCACAACTGA 80 
CC AG A TC T AC TTCCTAAACTCC AG AGCAGGGCGTTACG AG 120 
CTTGTCAACGAGGAGCTGTACAATGCATTTGAACAAGGGC 160 
AGGATTTCCCTTT 173 



WO 99/36537 PCT/US99/00I82 



92/U7 

drFATP partiai.protein 



10 20 30 40 
. i .i i -i .i i i I i i i t t i « i i i i i i i i i i .t i I i i i t i i i i i t 



SVDTTG7FK I QKTRLGREGYDPRLTTDQ I YFLNSRAGRYE 40 
LVNEELYNAFEOGQDFP 57' 



fio- 75 



WO 99/36537 „ 
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93/117 

ceFATPa coding only.DNA 



10 20 30 40 

_t i i i 1 i ■ i f 1 * i t i 1 t. i t i 1 t I- I I I I I I .« .1 t I t r 1 . | , ■ | 

ATGAAGCTGGAGGAGCTTGTGACAGTTATGCTTCTCACAG 40 
TGGCTGTCATTGCTCAGAATCTTCCGATTGGAGTAATATT 80 
GGCTGGAGTTCTTATTTTATACATCAC AGTGGTTCATGGA 120 
GATTTCATTTATAGAAGTT TATCTTACGTTGAATAGGGATT 160 
TAACAGGATTGGCTCTAATTATTGAAGTCAAAATCGACCT 200 

210 220 230 240 

■ i r I I t i t t I . t I I ( I i t I I 1 1 I 1 1 1 1 t 1 I till f lit, f 

ATGGTGGAGGTTGCATCAGAATAAAGGAATCCAJGAACTG 240 
TTTTTGGATATTGTGAAAAAGAATGCAAATAAGCCGGCGA 280 
TGATTGACATCGAGACGAATACAACAGAAACATACGCAGA 320 
GTTCAATGCACATTGTAATAGATATGCCAATTATTTCCAG 360 
GGTCTTGGCTATCGATCCGGAGACGTTGTCGCCTTGTACA 400 

410 420 430 440 

i f i i I i t t f I t i f x f i i t f f r r f r I i i i i I t t f r } f t t f f 

TGGAGAACTCGGTCGAGTTTGTGGCCGCGTGGATGGGACT 440 
CGC AAA AA TCGG AG TTG T A A'CGGCTTGG A TC A AC TCG A AT 480 
TTGAAAAGAGAGCAACTTGTTCATTGTATCACTGCGAGCA 520 
AGACAAAGGCGATTATCACAAGTGTAACACTTCAGAATAT 560 
TATGCTTGATGCTATCGATCAGAAGCTGTTTGATGTTGAG 600 
610 620 630 640 

1 i i t 1 i t t i t i i t i I i i i, i I i t i t 1 i i i t I i i , i i I mil 

GGAATTGAGGTTTACTCTGTCGGAGAGCCCAAGAAGAATT 640 
CTGGATTCAAGAATCTCAAGAAGAAGTTGGATGCTCAAAT 680 
TAC TACGG AAC C AAAGACCC TTG AC ATAGTAGATTTT AAA 720 
AGTATTCTTTGCTTCATCTATACAAGTGGTACTACTGGAA 760 
TGCCAAAAGCCGCTGTCATGAAGCACTTCAGATATTACTC 800 
810 820 830 840 

■ I f f 1 t f t t f \ 1 I i f I I f T I 1 I I I t f t t 1 f I T f f I f , t 1 f . 1 . 1 

GATTGCCGTTGGAGCCGCAAAATCATTCGGAATCCGCCCT 840 
TCTGATCGTATGTACGTCTCGATGCCAATTTATCACACTG 880 
CAGCTGGAATTCTTGGAGTTGGGCAAGCTCTGTTGGGTGG 920 
ATCATCGTGTGTCATTAGAAAAAAATTCTCGGCTAGCAAC 960 
TTTTGGAGGGATTGTGTAAAGTATGATTGTACAGTTTCAC 1000 

1010 1020 1030 1040 

i L 1 ,1 I I I t 1 I I I I I I till! 1 I 1 I I 1 t T L . t I . 1 I 1 1 1 t t 1 1 ■ 

AATACATTGGAGAGATTTGTCGGTACTTGTTGGC TCAGCC 1040 
AGTTGTGGAAGAGGAATCCAGGCATAGAATGAGATTGTTG 1080 
GTTGGAAACGGACTCCGTGCTGAAATCTGGCAACCATTTG 1 120 
TAGATCGATTCCGTGTCAGAATTGGAGAACTTTATGGTTC 1 160 
AACTGAAGGAACTTCATCTCTCGTGAACATTGACGGACAT 1200 



WO 99/36537 

PCT/US99/00182 



94/117 

ceFATPa coding only.DNA 

1210 1220 1230 1240 

tttti-itiftittit i t i i ) i i * i I t r t t ] t i t , ) i , « i ] 

GTCGGAGCTTGCGGATTCTTGCCAATATCCCCATTAACAA 1240 
AGAAAATGCATCCGGTTCGATTAATTAAGGTTGATGATGT 1280 
CACTGGAGAAGCAATCCGAACTTCCGATGGACTTTGCATT 1320 
GC A TG T AATC C AGG AG AG TC TGG AGC A A TGGTGTCG AC G A 1360 
TCAGAAAAAATAATCCATTATTGCAATTCGAGGGATATCT 1400 

1410 1420 1430 1440 

i i i i ! i,i i i I i i \ i 1 i i i > 1 i i i i 1 i i i t T i t i t I i i,i t 1 , . 
GAATAAGAAGGAAACGAATAAAAA6ATTATCAGAGATGTC 1440 
TTCGCAAAGGGAGATAGTTGCTTTTTGACTGGAGATCTTC 1480 
TTC ATTGGG A TCG TC T TGG TT A TGTATATTTCAAGGATCG 1520 
TACTGGAGATACTTTCCGTTGGAAGGGAGAGAATGT-GTCG 1560 
ACTACTGAAGTCGAGGCAATTC TTCATCCAATTACTGGAT 1600 

1610 1620 1630 1640 

r t i i I i i i i I i i i i I i t r t I i i i r I i t i i I t t i i 1 i i i r I 

TGTCTGATGCAACTGTTTATGGTGTAGAGGTTCCTCAAAG 1640 
AGAG GG AAG AG T TGG A A TGGCGTC AG TTG TTC GAGTTGTA 1680 
TC GC A TGAG G AAG ATG A AAC TCAATTTG TTC A TAG AG TTG 1720 
G AGC A AGAC T TGCC TC T TC GC TTACC AGC TACGCG AT TCC 1760 
TCAGTTTATGCGAATTTGTCAGGATGTTGAGAAAACAGGT 1800 

1810 1820 1830 1840 

t f t f f f i t.t f i ,t i i 1 i f r f t f t i t I r i f t I t f i t ] t t r r f 

ACATTCAAACTTGTGAAGACGAATCTACAACGATTAGGTA 1840 
TCATGGATGCTCCTTCAGATTCAATTTACATCTACAATTC 1880 
TGAAAATCGCAATTTTGTGCCGTTCGACAATGATTTGAGG 1920 
TGCAAGG TCTCACTGGGA AG TTATCC AT TTTA A 1953 



Fu. 76 B 



WO 99/36537 



PCT7US99/00182 



95/117 

ceFATPa coding onl y.p rotein 



10 20 30 40 

i » i i 1 i i i i I t i i i I » i i « 1 f i « . I i « t i I i i i i I i i i « t 

MKLEELVTVMLLTVAV I AGNLPI GV ILAGVL ILYI TVVHG 40 
OF I YRSYLTLNRDLTGLAL I IEVK I DLWWRLHQNKGIHEL 80 
FLD I VKKNPNKPAM I D I ETNTTETYAEFNAHCNRYANYFG 120 
GLGYRSGDVVALYMENSVEFVAAWMGLAK I GVVTAWI NSN 160 
LKREQLVHC I TASKTKA I I TSVTLGN I MLDA IDQKi-FOVE 200 
210 220 230 240 

G IEVYS VGEPKKNSGFKNLKKKLDAQ I TTEPKTLD I VDFK 240 
5ILCFI YTSGTTGMPKAAVMKHFRYYS I AVGAAKSFG IRP 280 
SDRMYVSMP I YHTAAG I LGVGQALLGGSSCV 1 RKKFSASN 320 
FWRDCVKYDCTVSGY I GE I CRYLLAQP VVEEESRHRMRLL 360 
VGNGLRAE I WGPFVDRFRVR I GELYGSTEGTSSLVNI DGH 400 

410 420 430 440 

I 1 I I f f f 1 1 I I I T 1 I I 1 I 1 I 1 T I 1 f I f 1 I I J 1 1 I I f I T I t I 

VGACGFLP I SPL TK KMHP VRL IK VDDVTGE A I RTSDGLC I 440 
ACNPGESGAMVST IRKNNPLLQFEGYLNKKETNKK I IRDV 480 
FAKGDSCFLTGDLLHWORLGYVYFKDRTGDTFRWKGENVS 520 
TTEVEA I LHP I TGLSDATVYGVEVPGREGRVGMASVVR VV 560 
SHEEDETGFVHRVGARLASSLTSYA I PGFMR f CQOVEKTG 600 

610 620 630 640 

,i i t i I i i \ i 1 t i i i I m r i I i-i ) , i I t f t 1. 1 ; i >■ i I i t i i ! 

TFKLVKTNLQRLG I MDAPSDS I Y I YNSENRNFVPFDNDLR 640 
CKVSLGSYPF. 651 



WO 99/36537 
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96/117 

ceFATFb coding only.DNA 



10 20 30 40 

i i t i 1 i i i i f t i i i t i ? t i ( i i i i I i i i i f i i \ i i i ,i i i, f 

A TG AGGGA AATGC CGG AC AGTCCCAAGTTTGCGT TAG TC A 40 
CGTTTGTTGTGTATGCAGTGGTTTTGTACAATGTCAACAG 80 
CGTTTTCTGGAAATTTGTATTCATCGGATATGTTGTATTT 120 
AGGCTGCTTCGCACTGATTTTGGAAGAAGAGCACTTGCCA 160 
CGTTACCTAGAGATTTTGCGGGAC TGAAGCTCTTAATATC 200 

210 220 230 240 

< * ■ t i ■ t i i i i i i ■ i i i i i f i i i i . i i i i * I i i i > i i . i i i 

GGTTAAGTCGACAATTCGTGGCTT^GTTCAAGAAA'GATCGC 240 
CCAATTCATGAAATCTTTTTGAATCAGGTGAAACAGCATC 280 
CAAACAAAGTGGCGATTATTGAAATTGAAAGTGGTAGGCA 320 
GTTGACGTATCAAGAATTGAATGCGTTAGCTAATCAGTAT 360 
GCTAACCTTTACGTGAGTGAAGGTTACAAAA'TGGGCGACG 400 
4 10 420 430 440 

> 1 I 1 J t I I I 1 1 I t I . f II it! ) I t 1 1 1 1 1 ,1 I l i 1 1 I I I 1 I M I II 

TTGTCGCTTTGTTTATGGAAAATAGCATCGACTTCrTTGC 440 
AATTTGGCTGGGACTTTCCAAGAT TGGAGTCGTGTCGGCG 480 
TTCATCAACTCAAACTTGAAGTTGGAGCCATTGGCACATT 520 
CGATTAATGTTTCGAAGTGCAAATCATGCATTACCAATAt 560 
CAATCTGTTGCCGATGTTCAAAGCCGCTCGTGAAAAGAAT 600 

610 620 630 640 

■ l t I I l t l I I l « l l I l I I t 1 t I I I I T I I I I I 1 t I t 1 l l I 

CTGATCAGTGACGAGATCCACGTGTTTCTGGCTGGAACTC 640 
AGG T TGA TGG ACG TC A T AG A AGTCTTCAGCAAG A TCTCC A 680 
TCTTTTCTCTGAGGATGAACCTCCAGTTATAGACGGACTC 720 
AATTTTAGAAGCGTTCTGTGTTATATTTACACTTCCGGTA 760 
CTACCGGAAATCCAAAGCCAGCCGTCAT TAAACACTTCCG 800 

810 820 830 840 

* * * * ' * » > ' 1 » * * ■ * ' * ■ * I * * ' ■ * 1 * * * I ■ * * ' * * * * ■ ! 

TTACTTCTGGATTGCGATGGGAGCAGGAAAAGCATTTGGA 840 
ATTAATAAGTCAGACGTTGTGTACATTACGATGC'CA'ATGT 880 
ATCACTCTGCCGCCGGTATCATGGGTATTGGATCATTAAT 920 
TGCATTCGGGTCGACCGC TGTTATTAGGAAAAAGTTTTCG 960 
GCAAGCAACTTCTGGAAAGATTGCGTCAAGTACAACGTCA 1000 

1010 1020 1030 1040 

i i I i 1 i t \ \ 1,1 i .1 t i i l I t t, i i I t i ( i I t i t i I i i t i f 

CAGCGACACAGTACATTGGAGAAATCTGCAGGTATCTTCT 1040 
GGCAGCGAATCCATGTCCTGAAGAGAAACAACACAACGTG 1080 
CGATTGATGTGGGGAAATGGTTTGAGAGGACAAATTTGGA 1 !20 
AAGAGTTTGTAGGAAGATTTGGAATTAAGAAAATTGGAGA 1 160 
GTTGTACGGCTCAACAGAAGGAAACTCCAAfATTGTTAAC 1200 
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97/117 

ceFATPb coding only.DNA 

1210 1220 1230 1240 

.... i . ■ < < t . « * t i i t i i I i i i t i i r ii It i i i i i i t i \ 

GTGGATAACCATGTTGGAGCTTGTGGATTCATGCCAATTT 1240 
ATCCCCATATTGGATCCCTCTACCCAGTTCGACTTATTAA 1280 
GGTTGATAGAGCCACTGGAGAGC TTGAACGTGATAAGAAC 1320 
GGACTCTGTGTGCCGTGTGTGCCTGGTGAAACTGGGGAAA 1360 
TGGTTGGCGTTATCAAGGAGAAAGATATTCTTCTAAAGTT 1 400 

1410 1420 1430 1440 

i ■ > ■ I ' ■ * > I i t ■ « I t « < t f i i i i 1 i i i t I i i i t F i i i i 1 

CGAAGGATATGTCAGCGAAGGGGATACTGCAAAGAAAATC 1 440 
TACAGAGATGTGTTCAAGCATGGAGATAAGGTGTTTGCAA 1480 
GTGGAGATATTCTTCATTGGGATGATCTTGGATACTTGTA 1 520 
CTTTGTGGACCGTTGTGGAGACACTTTCCGTTGGAAAGGG 1560 
GAGAACGTGTCAACTACTGAAGTTGAGGGAATTCTTCAGC 1 600 

1610 1620 1630 1640 

■ > ■ ■ i i i ■ ■ f i i t i i i « i « t i i t i t t i t i I i i t t I titil — 

CTGTGATGGATGTGGAAGATGCAACTGTTTATGGAGTCAC 1640 
TGTCGGTAAAATGGAGGGGCGTGCCGGAATGGCTGGTATT 1680 
GTCGTCAAGGATGGAACGGATGTTGAGAAATTCATC.GCCG 1 720 
ATATTACTTCTCGACTGACCGAAAATCTGGCGTCTTACGC 1 760 
AATCCCTGTTTTCATTCGGCTGTGCAAGGAAGTTGATCGA 1800 

1810 1820 1830 1840 

rti it iittl irtil 1 T 1 I I I I 1 I I tlltl 

ACCGGAACCTTCAAACTCAAGAAGACTGATCTTCAAAAAC 1840 
AAGGTTACGACCTGGTTGCTTGTAAAGGAGACCCAATTTA 1880 
CTACTGGTCAGCTGCAGAAAAATCCTACAAACCACTGACT 1920 
GACAAAATGCAACAGGATATTGACACTGGTGTTTATGA.TC 1 960 
GCATTTAA 1968 
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98/117 

ceFATPb coding only.protein 



10 20 30 40 

* * * « ■ < ■ * i * * * » 1 * * ' * i * * ■ ' 1 * ■ ■ * 1 * * « ■ 1 * ■ * ' J 
MREMPOSPKFALVTF VVYAV VL YNVNSVFWKFVF I GYVVF 40 
RLLRTDFGRRALATLPRDFAGLKLL I SVKST IRGLFKKDR 80 
P I HE I FLNQVKGHPNKVA I I E I ESGRGLTYQELNALANOY 120 
ANLYVSEGYKMGD VVALFMENS IDFFA IWLGLSK I GVVSA 1 60 
F I NSNLKLEPLAHS I NVSKCKSC I TN I NLLPMFK AAREKN 200 

210 220' 230 240 

' « ' ■ I t » ■ ■ I ■ t . « I . i ■ . I i i i t 1 i r it \ I i f i I I 

L I SDE IHVFLAGTQVDGRHRSLQQDLHLFSEDEPPV I DGL 240 
NFRSVLCY f YTSGTTGNPKPAV I KHFRYFWI AMGAGKAFG 280 
I NKSDVV Y I TMPMYHSAAG IMG I GSL I AFGSTAV I RKKFS 320 
ASNFWKDCVKYNVTATQY I GE I CRYLLAANPCPEEKGHNV 360 
RLMWGNGLRGQ I WKEFVGRFG I KK I GEL YGSTEGNSN I VN 400 

410 420 430 440 

t 1 I 1 I I I I 1 1 1 I 1 I I 1 I I I f 1 I I T 1 t 1 I I I t I t T T t 1 t | f 

VDNHVGACGFMP I YPH I GSLYPVRL I K VDRATGELERDKN 4.40 
GLCVPCVPGETGEMVGV I KEKD I LLKFEG YVSEGDTAKK I 480 
YRDVFKHGDKVFASGD I LHWDOLGYLYFVDRCGDTFRWKG 520 
ENVSTTEVEG I LQPVMDVEDATVYGVTVGKMEGRAGMAG I 560 
VVKDGTDVEKF I AO t TSRLTENLASYA IP VF I RLCKEVDR 600 

610 620 630 640 

* ■ ' ' 1 ■ • * * t ■ » > ■ i i « i ■ I itit t i t t i I iiii t . . . . I 

TGTFKLKKTDLGKQGYDLVACKGDP I Y YWSAAEKSYKPLT 640 
DKMQQD I DTGVYOR I. 656 
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99/117 

chFATF coding only.PNA 



10 20 30 40 

* ^ ■ ■ ■ * * ■ i < i t i i i t i i t t i I i t . t i . . « | | 



ATGGCGTGTATGCATCAGGCTCA'GCTATACAATGA TC TAG 40 
AGGAATTGCTAACTGGTCCATCAGTACCCATCGTTGCTGG 80 
AGCTGCTGGAGCTGCAGCTCTCACTGCGTACATTAACGCC .120 
AAATACCACATAGCCCATGATCTCAAGACCCTGGGTGGTG 160 
GATTGACACAATCGTCCGAAGCGATTGATTTCATAAACCG 200 
210 220 230 240 

1 « ■ ■ ' 1 ■ ■ ■ «-1 1 1 1 ' * 1 ■ ■ * 1 1 1 ' ' 1 i i i i F t i i « I .... | 



CCGCGTCGCACAAAAGCGCGTCCTCACGCACCACATCTTC-240 
CAGGAGCAGGTCCAAAAACAATCAAATCATCCCTTTCTTA 280 
TCTTTGAGGGCAAGACATGGTCTTACAAGGAGTTCTCTGA 320 
GGCATACACGAGGGTCGCGAACTGGCTGATTGATGAGCTG 360 
GACGTACAAGTAGGGGAGATGGTGGCAATTGATGGCGGAA 400 

4 tO 420 430. 440 

■ » « ■ * ■ « * ■ I ■ * * ■ * * * * ■ l ' 1 ' ■ 1 - * ■ « ' I » ■ ■ ■ ' ■ ' t * I 



ATAGTGCAGAGCACCTGATGCTTTGGCTTGCACTTGATGC 440 
AATCGGTGCGGCTACGAGTTTTTTGAACTGGAACCTGACA 480 
GGGGCAGGGTTAATTCATTGCATAAAGCTATGCGAATGTC 520 
GATTCGTTATCGCAGACATCGATATTAAAGCGAACATTGA 560 
ACCGTGCCGTGGCGAACTGGAGGAGACGGGCATCAACATT 600 
610 620 630 640 . 

' l ' I t ■ ' ' ' ' ' t t > ) t I I. t_l -t i 1 I I t f I t 1 t t I r 1 



CACTACTATGACCCATCCTTCATCTCATCGCTACCGAATA 640 
ACACGCCAATTCCCGACAGCCGCACTGAGAACATTGAATT 680 
AGATTCAGTACGAGGACTGATATACACATC TGGAACCACT 720 
GGTCTACCTAAAGGCGTGTTTATAAGCACTGGCCGCGAGC 760 
TTAGGACTGACTGGTCGATTTCAAAGTATC TAAATCTCAA 800 

810 820 830 840 

' « ' * * t t f t I i t i ) It t t i I t t i i I i f f r I i i i i i i , < t I 



GCCCACGGATCGAATGTATACATGTATGCCGCTCTACCAT 840 
GCCGCTGCACACAGCCTCTGTACAGCATCAGTTATTCATG 880 
GTGGAGGTACCGTGGTATTGAGCAGGAAATTCTCACACAA 920 
GAAGTTCTGGCCTGAAGTTGTGGCTTCGGAAGCAAATATC 960 
ATTCAGTACGTTGGTGAATTAGGTCGATATCTCCTGAATG 1000 
1010 1020 1030 1040 

' ' * ' 1 ' ■ ■ 1 * ■ 1 ' 1 1 * ' 1 ' J 1 ■ ' ' 1 * 1 « ■ f ' 1 » 1 1 * ' » ■ t 



GTCCAAAGAGTCCTTACGACAGGGCCCATAAAGTCCAGAT 1040 
GGCG TGGGGC A A TGGC A TGCGTCC AG ACGTGTGGG A AGCG 1080 
TTTCGTGAACGCTTCAACATACCAATTATTCATGAGCTCT 1 120 
ATGCCGCAACCGATGGGCTCGGGTCAATGACCAArCGTAA 1 160 
CGCGGGCCCTTTTACAGCAAACTGTATTGCGCTGCGAGGG 1200 



WO 99/36537 



100/117 

chFATP coding only.DNA 

1210 1220 1230 1240 

ti<i1 tttt\itti\tttt)ltir \ rill) ttitltttil 

CTGATCTGGCACTGGAAATTTCGAAATCA6GAAGT6CTGG 1240 
TCAAGATGGATCTCGATACTGATGAGATCATGAGAGATCG 1280 
CAATGGGTTTGCGATACGATGCGCTGTCAA7GAACCTGGA 1320 
C AGA TGC TTT TTCGGCTG AC ACCCGAA AC TCTGGCTGGTG 1360 
CACCAAGCTACTACAACAACGAAACGGCCACACAGAGCAG 1400 

1410 1420 1430 1440 

■ » i t I i i i i I i i i i I i i t t I » r i i t 'i i i i f - i i « i I i i i i l 

GCGGATTACAGATGTGTTTCAAAAGGGTGACCTGTGGTTC 1440 

AAGTCCGGTGACATGCTACGGCAAGAeGCCGAAGGCCGCG 1480 

TCTACTTTGTCGATCGACTAGGCGATACGTTCCGCTGGAA 1520 

ATCCGAAAACGTTTCTACCAATGAAGTCGCGGACGTGATG 1560 

GGC A C AT TTC C TC AGA T TGC TG A A ACG A ATGTATACGGTG 1600 

1610 1620 1630 1640 

i t i i I,, i i t i 1 i i i i I i.i i 1. 1 i i i i I l il l I t i i i 1 i i i i 1 _ 
TCCTTGTGCCGGGTAACGATGGTCGAGTGeGCAGCCfCAA 1640 
TTGrCATGGCAGACGGCGTGACAGAGTCGACATTCGCTTC 1680 
GCTGCCCTTGCAAAGCACGCCCGAGATCGGTTACCGGGTT 1720 
ATGCTGTACCACTGTTTCTGAGGGTAACfCCAtSCACf TGA 1760 
ATATACGGGCACA7TAAAGATTCAGAAAGGACGCCTCAAG 1800 
1810 1820 1830 1840 

I = : t 1 i,i i t \ i : f t J > i r f'l f t 1 i I M I, t 1 r I t . f t i t r 1 

C AGG A AGG TA T AG AC CC AG AT AAG A TTTCCGGCGAAG AT A 1 840 
AGTTATACTGGCTGCCGCCTGGTAGCGATATATATTTACC 1880 
ATTTGGAAAGATGGAGTGGCAGGGAATTGTAGATAAGCGT 1920 
ATACGGCTGTGA 1932 



WO 99/36537 



PCT7US99/00182 



101/117 

chFATP coding onty.protein 



10 20 30 40 

■ «» ■ 1. 1 i i i i i i «■ i i i i i i i . i i i i t i 1. 1 l i i i i t < .i 1 1 i- 
MACMHOAQLYNDLEELLTGPSVP I VAGAAGAAALTAY ENA 40 
KYHIAHDLKTLGGGLTOSSEAIDFINRRVAOKRVLTHHIF 80 
GEQVGKGSNHPFLIFEGKTWSYKEFSEAYTRVANWLIDEl 120 
DVQVGEMVA IDGGNSAEHLMLWLALDAI GAATSFLNWNLT 160 
GAGL IHC 1 KLCECRFV IAD I D I KAN I EPCRGELEETG I N I 200 

210 220 230 240 

* ■ * * 1 * * * * * » * * * 1 ' * * * * ' * * * * 1 * ■ ■ ' I * * » * 1 ■ * * * * 
HYYOPSFI SSLPNNTP IPDSRTEN IELDSVRGL I YTSGTT 240 
GLPKGVF I STGRELRTOWS I'SK YLNLKPTDRMYTCMPLYH 280 
AAAHSLCTASV IHGGGTVVLSRKFSHKKFWPEV VASEAN I 320 
I QYVGELGRYLLNGPKSPYORAHKVOMAWGNGMRPDVWEA 360 
FRERFNIP I IHEUYAATDGLGSMTNRNAGPFTANCIALRG 400 

410 420 430 440 

* ' ' ' * i i i t l . . * . i i . . i I i i t t i i ■ t i I t i -i i i it « . . j 

L IWHWKFRNQEVLVKMDLDTDE I MRDRNGFA1 RCAVNEPG 440 
QMLFRLTPETLA6APSYYNNET ATQSRR I TOVFQKGOLWF 480 
KSGDMLRGDAEGRVYFVDRLGDTFRWKSENVSTNEVADVM, 520 
GTFPQ I AETNVYGVLVPGNDGRVRSLNCHGRRRDRVO IRF 560 
AALAKHARDRLPGYAVPLFLRVTPALEYTGTLKIGKGRLK 600 

610 620 630 640 ■ 

i t i t f i i i i I i i t i I i , i i i I i r i i f i i i i 1 i t i i I . m i i I 

QEG1DP0K I SGEDKLYWLPPGSD I YLPFGKMEWQG I VDKR 640 
IRL 643 



WO 99/36537 
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102/117 

aspergiilus partial.DNA 



10 20 30 40 

J.Lr t f 1 r i ,i t I i , t i I i 1 i i 1 i l i i I ti i t f t 1 t , 1., 



CTTTACCATTCATCAGCT TC AT TCTGCATTTTTAGCTTG A 40 
CGGCAGCCGGGTCTACGCT.GATCATCGGCCGCAAGTTCTC 80 
CGCG AGAAAC T TC AT A A AGG A AGO GCGCG AG AACG AC GCC 120 
ACGGTCATCCAGTACGTGGGTGAGACCTTGCGATATCTGC 160 
TCGCCACCCCCGGTGAAACCGATCCAGTTACTGGCGAAGA 200 

210 220 230 240 ■' " 

t T I 1 1 I 1 1 I I t I 1 1 I 1 1 I t I 1 ,T I I I Vl I T I 1 1 I 1 1 \ I 1 1 I 



CCTGGACAAAAAGCACAATATTCGAGCAGTATACGGCAAC 240 
GGGCTACGGCCGGATATCTGGAACCGCTTCAAGGAGCGCT 280 
TCAACGTGCCGACGGTTGCCGAATTTTATGCTGCAACCGA 320 
GAGCCCAGGCGGAACATGGAACTATTCAACAAATGACTTC 360 
ACTGCCGGAGCCATTGGGCACACTGGCGTGGTTAGTGGAT 400 

410 420 430 440 

t t * i * * » ' > i » * ■ « i t i t i I t i > t r i t * i \ I i t r i i i i t t 



GGCTTCTTGGACGCGGCCTTACTATTGTCGAGGTGGACCA 440 
GGAATCACAGGAACCATGGCGCGATCCCCAAAC.GGGGTTC 480 
TGCAAGCCGGTCCCGCGAGGCGAAGCAGGCGAGCTCCTGT 520 
ATGCCATTGATCCGGCCGACCCGGGCGAGACGTTCCAGGG 560 
CTACTACCGCAACTCCTTTAGAGtACACTGGCGGCCG 597 



WO 99/36537 
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103/117 

aspergillu s partia l. protein 



10 20 30 40 

■ t » * i i i i i i i i i * i i t i i i t t t i i i i 1. 1 i i . i ■ 1 1 i i 



LYHSSASFC JFSLTAAGSTL I I GRKFSARNF I KEAREMOA 40 
TVI QYVGETLRYLLATPGETDPVTGEDLDKKHN IRAVYGN 80 
GLRPD I WNRFKERFNVPTVAEFYAATESPGGTWNYSTNDF 120 
TAGA1 GHTGVLSGWLLGRGLT1 VEVDQESQEPWRDPQTGF 160 
CKPVPRGEAGELLYA IDPADPGETFQGYYRNSFRAHWRP 199 



WO 99/36537 
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104/117 



nngFATP partiaLPNA 



10 



20 30 40 

1 ' * * * 1 * > * * 1 1 > * * i * * ] ' * * * * ' i 



GCAAAGGCCGACGCGTGGCTGCGGACGGGTAACGTGATCA 40 
GGGCGGACAACGAAGGGCGACTCTTCTTCCACGACCGGAT 80 
CGGAGACACGTTCCGATGGAAGGGAGAGACNGTCAGCACA 120 
C AAGAGGTC AG TTTG G TGC TCGGACGACACG AC TC A A TC A 160 
AGGAGGCCAACGTGTACGGCGTGACGGTGCCGAACCACGA 200 

210 220 230 240 

i i „i t 1 I r -i i ! t i i \ \ i i i i \ \ i i t I 1 i i 1. 1 i i i i I i i t i I 

CGGGCGGGCCGGCTGCGCTGCGCTCACGCTATCAGACGCT 240 
CTGGCGACTGAAAAGAAGCTGGGCGATGAGCTGCTAAAGG 280 
GATTGGCTACTCACTCGTCGACTTCGCTTCCCAAGTTTGC 320 
GGTGCCGCAGTTCCTACGGGTGGTGCGCGGCGAGATGCAG 360 
TCAACGGGCACCAACAAGCAACAGAAGCACGACCTGAGGG 400 



TGCAGGGTGTAGAGCCGGGCAAGGTGGGCGTAGACGAGGT 440 
GTACTGGTTGCGGGGAGGGACATATGTACCATTCGGAACA 480 
GAGGATTGGGATGGGTTGAAGAAGGGTCTTGTGAAGTTGT 520 
GA 522 





WO 99/36537 



PCT/US99/00182 



105/117 

mgFATP partial.protein 



10 20 30 40 

i . j, i i 1 i r i i t i i t i 1 1 i i i i I i i i r I i t i t I i i i t T i t i t I 

AKADAWLRTGNV I RADNEGRLFFHDR I GDTFRWKGETVST 40 
QEVSLVLGRHDSIKEANVYGVTVPNHDGRAGCAALTLSDA 80 
LATEKKLGDELLKGLATHSSTSLPKFAVPQFLRVVRGEMG 120 
STGTNKQQKHOLRVQGVEPGKVGVDEVYWLRGGTYVPFGT 160 
EDWDGLKKGLVKL 173 



Ffa. K 
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106/U7 

scFATP coding onty.DNA 



10 20 30 40 

* ■ ■ * 1 * * ■ ■ t * 1 * * 1 ■ 1 1 ■ * 1 ■ 1 1 1 1 * * ■ t 1 1 r 1 1 * ' ■ ■ 1 

ATGTCTCCCATACAGGTTGTTGTCTTTGCCTTGTCAAGGA 40 
riTTCCTGCTATTATTCAGACTrATCAAGCTAATTATAAC 80 
CCCTATCCAGAAATCACTGGGTTATCTATTTGGTAATTAT 120 
TTTGATGAA7TAGACCGTAAATA7AGATACAAGGAGGAT7 160 
GGTATATTATTCCTTACTTTTTGAAAAGCGTGTTTTGTTA 200 

210 220 230 240 

.i«tt «itif ttiif tiiti .^. r f 

TATCATTGATGTGAGAAGACATAGGTTTCAAAACTGGTAC 240 
TrATTTATTAAACAGGTCCAACAAAATGGTGACCATTTAG 280 
CGATTAGTTACACCCGTCCCATGGCCGAAAAGGGAGAATT 320 
TCAACTCGAAACCTTTACGTATATTGAAACTTATAACATA 360 
G TG T TG AG ATTGTCTC A TAT TTTGC A TTTTGATTATAACG 400 

410 420 430 440 

■ ■ ■ » * . ... I .... i ... > i . . . , f . . , , ) , . , . f .... I 

TTCAGGCCGGTGACTACGTGGCAATCGATTGTACTAATAA 440 
ACCTCTTTTCGTATTTTTATGGCTTTCTTTGTGGAACATT 480 
GGGGCTATTCCAGCTTTTTTAAACTATAATACTAAAGGCA 520 
- CTCCGCTGGTTCACTCCCTAAAGATTTCCAATATTACGC A 560 
GG TA T T TA TTG AC CC T GATGC CAGT AATCCG A TC AG A GA A 600 

610 620 630 640 

» ' * ■ i ■ ■ « ' ) ■ ' "» * 1 1 » 1 ■ i ' 1 * 1 * ■ ■ f * 1 ■ ■ ■ ■ 1 * « ' ■ 1 
TCGGAAGAAGAAATCAAAAACGCACTTCCTGATGTTAAAT 640 
TAAACTATCTTGAAGAACAAGACTTAATGCAT6AACTTTT 680 
AAATTCGCAATCACCGGAATTCT7ACAACAAGACAACGTT 720 
AGGACACCACTAGGCTTGACCGATTTTAAACCCTCTATGT 760 
TAATTTATACATCTGGAACC ACTGGTTTGCCTAAATCCGC 800 

810 820 830 840 

i I i i I t i i ,1 I t i i i I t r i i 1 i i i i I t i i i I i i t i I t i t i . t 

TA TTA TGTC TTGGAGA AAA TCCTCCGTAGGTTGTCA AG TT 840 
TTTGGTCATGTTTTACATATGACTAATGAAAGCACTGTGT 880 
TCACAGCCATGCCATTGTTCCATTCAACTGCTGCCTTATT 920 
AGGTGCGTGCGCCA7TCTATCTCACGGTGGT7GCCTTGCG 960 
TT A TCGC A T AAA T TT TC TGC C AG T AC A TTTTGG A AGC A AG tOOO 

1010 1020 1030 1040 
. t i i i i i i « i t ■ i t i t i i i i I i t i t f t i t i I i i i i t i i i 1. 1 

TTTATTTAACAGGAGCCACGCACATCCAATATGTCGGAGA 1040 
AG TC TG TAGA TACC TG TT AC ATACGCC A A TTTCTAAGT AT 1080 
GAAAAGATGCATAAGGTGAAGGTTGCTTATGGTAACGGGC 1 120 
TGAGACCTGACATCTGGCAGGACTTCAGGAAGAGGTTCAA I 160 
CATAGAAGTTATTGGTGAATTCTATGCCGCAACTGAAGCT 1200 



fia. %A 
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107/117 

scFATP coding oniy.PNA 

1210 1220 1230 1240 

■ » * * i i i i t i i t i . i | . t t i i i t i i i i t i i i i ■ ■ i t i > t i 

CC TT T TGCTAC A AC TACCTTCC AG AAAGGTG AC TTTGGAA 1240 
TTGGCGCATGTAGGAACTATGGTACTATAATTCAATGGTT 1280 
TTTGTCATTCCAACAAACATTGGTAAGGATGGACCCAAAT 1320 
G ACG A TTCCGTT AT A TATAG A A ATTCCAAGGGTTTCTGCG 1360 
AAGTGGCCCCTGTTGGCGAACCAGGAGAAATGTTAATGAG 1400 

1410 1420 1430 1440 

■ « ■ * J ■ * ■ » * ■ * * * * ' ■ ■ » 1 ■ ■ * * * » « 1 * I * ' 1 * i ■ * ' ■ l 

AATCTTTTTCCCTAAAAAACCAGAAACATCTTTTCAAGGT 1440 
TATCTTGGTAATGCCAAGGAAACAAAGTCCAAAGTTGTGA 1 480 
GGGATGTCTTCAGACGTGGCGATGCTTGGTATAGATGTGG 1520 
AGAT TTATT AAA AG CGGACGA AT A TGGATTATGGTATTTC 1560 
C TTGA TAG A ATGG GTGA T AC TTTC AG A TGG A AATCTG AAA 1600 

1610 1620 1630 1640 
* * * * * 1 * * 1 i i i t i i i * t i I i i i i i i i t i f i i t i i i > t i l 

ATGTTTCCACTACTGAAGTAGAAGATCAGTTGACGGCCAG 1640 
TAACAAAGAACAATATGCACAAGTTCTAGTTGTTGGTATT 1680 
AAAGTACCTAAATATGAAGGTAGAGCTGGTTTTGCAGTTA 1720 
TTAAACTAACTGACAACTCTCTTGACATCACTGCAAAGAC 1760 
CAAATTATTAAATGATTCCTTGAGCCGGTTAAATCTACCG 1800 

1810 1820 1830 1840 

' ' ' 'I * * t 1 ) 1 * I ♦ 1 l < I I I t ! t t ) t ! 1 t ] I f f , I , ft , I 

TCTTATGCTATGCCCCTATTTGTTAAATTTGTTGATGAAA 1840 
TTAAAATGACAGATAACCTCATAAAATTTTGA 1872 
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108/117 



scFATP coding only.protein 



10 20 

■t.t ,t f I t, t t i.l t i„ t t f, t i ,f, f I r i i f I i, 



30 40 

i f f f i ■ i t I t t i r I 



MSP J QVVVFALSR IFLLLFRL IKL I I TP I GKSLGYLFGNY 40 
FDELDRKYRYKEDWY I IPYFLKSVFCYI 1 OVRRHRFQNWY 80 
LF I KQ VQQNGDHLA I SYTRPMAEKGEFOLETFTY I ETYN I 1 20 
VLRLSH I LHFDYNVQAGD YVA I DCTNKPLFVFLWLSLWN I 160 
GA I PAFLNYNTKGTPLVHSLK 1 SN I TQVF I DPDASNP IRE 200 
210 220 230 .240 



SEES I KNALPDVKLNYLEEQDLMHELLNSQSPEFLGQDNV 240 
RTPLGLTDFKPSMLI YTSGTTGLPKSA I MSWRKSSVGCQV 280 
FGHVLHMTNESTVFTAMFLFHSTAALLGACA I LSHGGCLA 320 
LSHKFSASTFWKGVYLTGATH I QY VGEVCRYLLHTP I SKY 360 
EKMHK VKVAYGNGLRPO I WQOFRKRFN I EV I GEFY AATEA 400 
410 420 430 440 



J 1 ■■ T 1 1 1 1 T t 1 T M I I I I t I I I I I 1 f 1 t i I f 1 1 f 1 f I 1 1 J . i 



PFATTTFQKGDFGIGACRNYGTI I QWFLSFQQTLVRMDPN 440 
DOSV I YRNSKGFCEVAP VGEPGEMLMR I FFPKKPETSFQG 480 
YLGNAKETKSKVVRDVFRRGDAWYRCGDLLKADEYGLWYF 520 
LORMGDTFRWKSENVSTTEVEDGLTASNKEGYAQVLVVGI 560 
K VPK YEGRAGFAV I KLTDNSLD I TAKTKLLNDSLSRLNLP 600 
610 620 630 640 
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109/117 

rotFATP coding only,DNA 



10 20 30 40 

■ f t f t f f f 1 I I | I t f I T f T f f t t f I 1 f f t f 1 I f t 1 I I | \ J | 

GTGTCCGATTACTACGGCGGCGCAGACAGAACGGTCAGGC 40 
TGATCGACCTGGCAACTCGGATGCCGCGAGTGTTGGCGGA "80 
CACGCCGGTGATTGTGCGTGGGGCAATGACCGGGCTGCTG 120 
GCCCGGCCGAATTCCAAGGCGTCGATCGGCACGG7GTTCC 160 
AGGACCGGGCCGCTCGCTACGGTGAGCGAGTGTTCCTGAA 200 
210 22D 230 240 

* 1 * 1 * i t i i 1 .ii.l i i t i 1 t i t i 1 t t i i I t i i i \ , , t , I 

ATTCGGCGATCAGCAGCTGACCTACCGCGACGCTAAGGCC 240 
ACCGCCAACCGGTACGCCGCGGTGTTGGCCGCCCGCGGCG 280 
TCGGCCCCGGCGACGTCGTTGGCATCATGTTGCGTAAC'TC 320 
ACCCAGCACAGTCTTGGCGATGCTGGCCACGGTCAAGTGC 360 
GGCGCTAfCGCCGGCATGCTCAACTACCACCAGCGCGGCG 400 
410 420 . 430 440 

■ i ■ i I i t i i i t t t i t I i i i i f it r i ~ I t t t t t t i . ♦ t 
AGGTGTTGGCGCACAGCCTGGGfCTGCTGGACGGGAAGGTV 440 
AC7GATCGCAGAGTCCGACTTGGTCAGCGCCGTCGCCGAA 480 
TGCGGCGCCTCGCGCGGCCGGGTAGCGGGCGACGTGCTGA 520 
CCGTCGAGGACGTGGAGCGATTCGCCACAACGGCGCCCGC 560 
CACCAACCCGGCGTCGGCGTCGGCGGTGCAAGCCAAAGAC 600 

610 620 630 640 

■ i i ■ i i i i . I i i i n i i i i I - i i i i i i i i i I i t t i i * . ■. i 

ACCGCGTTCTACATCTTCACCTCGGGCACCACCGGATTTC 640 
CCAAGGCCAGTGTCATGACGCATCATCGGTGGCTGCGGGC 680 
GCTGGCCGTCTTCGGAGGGATGGGGCTGCGGCTGAAGGGT 720 
TC CG AC ACGC TC T AC A GC TGCCTGCCGCTGTACC A CAAC A 760 
ACGCGTTAACGGTCGCGGTGTCGTCGGTGATCAATTCTGG 800 
810 820 830 840 

■ * ' ' ■ * * 1 ■ i ■ i I t < t ■ > * * i t \ ,.i t t T , , . , t , , , , f 

GGCGACCCTGGCGCTGGGTAAGTCGTTTTCGGCGTCGCGG 840 
TTCTGGGATGAGGTGATTGCCAACCGGGCGACGGCGTTCG 880 
TCTACATCGGCGAAATCTGCCGTTATCTGCTCAACCAGCC 920 
GGCCAAGCCGACCGACCGTGCCCACCAGGTGCGGGTGATC 960 
TGCGGTAACGGGCTGCGGCCGGAGATCTGGGATGAGTTCA 1000 

1010 1020 1030 1040 
* ' * 1 1 ■ * ■ ' I i t t t i » » i > I * * i t i t i i t I i i t t i i , . . i 

CCACCCGCTTCGGGGTCGCGCGGGTGTGCGAGTTCTACGC 1040 
CGCCAGCGAAGGCAACTCGGCCTTTATCAACATCTTCAAC 1080 
GTGCCCAGGACCGCCGGGGTATCGCCGATGCCGCTTGCCT 1 120 
TTGTGGAATACGACCTGGACACCGGCGATCCGCTGCGGGA 1 160 
TGCGAGCGGGCG AG TGCGTCGGGTACCCGACGGTG A ACCC 1200 
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110/117 

mtFATP coding only.DNA 

1210 1220 1230 1240 

. * « t -t I » ■ 1 I I I F I I I I I I t I I F I F I I I I f I I I I f I I t J 

GGCCTGTTGCTTAGCCGGGTCAACCGGCTGCAGCCGTTCG 1240 
ACGGCTACACCGACCCGGTTGCCAGCGAAAAGAAGTTGGT 1280 
GCGCAACGCTTTTCGAGATGGCGACTGTTGGTTCAACACC- 1320 
GGTGACGTGATGAGCCCGCAGGGCATGGGCCATGCCGCCT 1360 
TCGTCGATCGGCTGGGCGACACCTTCCGCTGGAAGGGGGA • 1 400 

1410 1420/ . 1430. ' 1440 

1 1 ' ■ * * ' * ' I t t . . I i I t I \ \ i I i t t I r I \ I .i.i r ( t t i ■ I 

GAATGTCGCCACCACTCAGGTCGAAGCGGCACTGGCC TCC 1440 
GACCAGACCGTCGAGGAGTGCACGGTCTACGGCGTCCAGA- 1480 
TTCCGCGCACCGGCGGGCGCGCCGGAATGGCCGCGATCAC' 1520 
ACTGCGCGCTGGCGCCGAATTCGACGGCCAGGCGCTGGCIC 1 560 
CGAACGGTTTACGGTCACTTGCCCGGCTAf GCACTTCCGC 1600 

1610 1620 - 1630 1640 

■ ■i.li ) t it 1 ... I i i i > l * t_ttt.it i ! t i til i i i i t 

TCTTTGTTCGGGTAGTGGGGTCGCTGGCGCACACCACGAC '1640 
GTTCAAGAGTCGCAAGGTGGAGTTGCGCAACCAGGCCTAT..1680 
GGCGCCGACATCGAGGATCCGCTGTAGGTAGTGGGCGGCC 1720 
CGGACGAAGGATATGTGCCGTACTACGCCGAATACCCTGA 1760 
GGAGGTTTCGCTCGGAAGGCGACCGCAGGGCTAG 1794 



WO 99/36537 



PCT/US99/00182 



111/1X7 

mtFATP coding only .protein 



.10 20 30 40 
, 1 1 1 1 t i i 1 i i i i l t i i i t i i i i ( i i,i i 1. 1 i ) i 1 i i t 1 1 

MSDYYGGAHTTVRL I DLATRMPRVLADTPV I VRGAMTGLL 40 
ARPNSKAS I GTVFQDRAARYGDRVFLKFGOQQLTYROANA 80 
TANRYAAVLAARGVGPGDVVGIMLRNSPSTVLAMLATVKC 120 
GA I AGMLNYHQRGEVLAHSLGLLOAKVL I AESDLVSAVAE 160 
CGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKO 200 

210 220 ; 230 240 

*i ■ « . » I i ■ t i I ■ * i » I ■ ♦ * t I . i t > L i i t » I « t < < 1 iittl 

TAFY IFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKG 240 
SOTLYSCLPLYHNNALTVAVSSV I NSGATLALGKSFSASR 280 
FWDEV I ANRATAFV Y I GE I CRYLLNGPAKPTDRAHQVRV I 320 
CGNGLRPEIWDEFTTRFGVARVCEFYAASEGNSAF1 NI'FN 360 
VPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP 400' 

410 420 430 440 

' * ' » 1 * » * ■ » 1 * * * * 1 * * f ■ 1 i i t t i § i i i 1 i i i i i t i < t i ' 
GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNT 440 
GDVMSPQGMGHAAFVDRLGDTFRWKGENVATTQVEAALAS 480 
DQT VEECtVYGVQ I PRTGGRAGMAAI TLRAGAEFDGQALA 520 
RTVYGHLPGYALPLFVRVVGSLAHTTTFKSRKVELRNQAY 560 
GAD I EDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPGG. 598 



fv,. Sf 
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116/117 



haPATP3 

1 cga ccc acg cgt ccg ggg atg etc gcg age ggc tgg aac cag acg gtg ccg ata gag gaa 
! MFASGWNQTVPIEE 

61 gcg ggc tec atg get gec etc ctg ctg ctg ccc ctg ctg ctg ttg eta ccg ctg ctg ctg 

15 AGSMAALLLLPLLLLLPLLL 

121 ctg ctg aag eta cac etc tgg ccg cag ttg cgc tgg ctt ccg gcg gac ttg gec ttt gcg 

35 LLKLHLWPQLRWLPADLAFA 

1B1 gtg cga get ctg tgc tgc aaa agg get ctt cga get cgc gee ctg gec gcg get gec gec 

55 VRALCCKRALRARALAAAAA 

241 gac ccg gaa ggt ccc gag ggg ggc tgc age ctg gec tgg cgc etc gcg gaa ctg gec cag 

75 DPEGPEGGCSLAWRL.AELAQ 

301 cag cgc gec gcg cac acc ttt etc att cac ggc teg egg cgc ttt age tac tea gag gcg 

95 Q R A A H T F.I* IHGSRRFSYSEA 

361 aag cgc gag age aac agg get gca cgc gec ttc eta cgn gcg eta ggc tgg gac tgg gga 

115 SRESNRAARAFLRALGWDWG 

421 ccc gac ggc ggc gac age ggc gag ggg age get gga gaa ggc gag egg gca gcg ccg gga 

135 pdggdsgegsa'gegeraapg 

481 gec gga gat gca gcg gee gga age ggc gcg gag ttt gec gga ggg gac ggc gec gec aga 

155 AGDAAAG S GAEFAGGDGAAR 

541 ggt gga gga aag ccc gec gec ccc ctg tea cct gga gca act gtg gcg ctg etc etc ccc 

175 GGGEPAAPIiSPGATVALLLP 

601 get ggc cca gag ttt ctg egg etc tgg ttc ggg ctg gec aag gec ggc ccg cgc act gec 

195 AGPEFOWLWrGLAKAGLRTA 

661 ttt gtg ccc acc gec ctg cgc egg ggc ccc ctg ctg cac tgc etc cgc age cgc ggc gcg 

215 FVPTALRR GFLLHCLRSCGA 

721 cgc gcg c-g gcg ccg gcg cca gag ttt ccg gag ccc ctg gag ccg gac ctg ccc gec ctg 

235 RALVLAPEFLESLEPDLPAL 

781 aga gec atg ggg etc cac ctg tgg get gca ggc cca gga acc cac cct get gga att age 

255 RAMGLH LWAAG PGTH PAG I S 

841 gat ctg ctg get gaa gcg tec gen gaa gcg gat ggg cca gtg cca gga tac etc tct tec 

275 DLLAEVSAEVDGPVPGYLSS 

901 ccc cag age ata aca gac acg tgc ctg tac ate etc acc tct ggc acc acg ggc etc ccc 

295 PQSITDTC L Y I FTSGTTGLP 

961 aag get get egg ate agt cat ctg aag ate ctg caa tgc cag ggc ttc tat cag ccg tgt 

315 KAARISHLKILQCQGFYQLC 

1021 ggt gtc cac cag gaa gat gtg ate cac etc gee etc cca etc tac cac atg tec ggt tec 
335 GVHQEDV I YLAL PLYHMSGS 

10B1 ctg ctg ggc ate gtg ggc tgc atg ggc att ggg gee aca gtg gtg ctg aaa tec aag ttc 
355 LLGIVGCMGIGATVVLKSKF 

1141 teg get ggt cag ttc tgg gaa gat tgc cag cag cac agg gtg acg gcg etc cag tac att 
375 SAGQFWEDCQQHRVTVFQYI 

1201 ggg gag ctg tgc cga tac ctt gee aac cag ccc ccg age aag gca gaa cgt ggc cat aag 
395 GELCRYLVNQPPSKAERGHK 
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gcc egg ctg gca gcg ggc age ggg ctg cgc cca gac acc tgg gag cgc cct gcg egg cgc 
VRLAVGSG LRPDTWERFVRR 

tec ggg ccc ctg cag gtg ctg gag aca tat gga ctg aca gag ggc aac gtg gcc acc ate 
F G P LQVL E T Y G LT EGNVAT I 

aac tac aca gga cag egg ggc gee gtg ggg cge gee tec tgg ctt cac aag cat ate ttc 
NYTQQRGAVGRASWLYKHI F 

ccc ttc tec ttg att cgc tat gac gtc acc aca gga gag cca att egg gac ccc cag ggg 
PFSLIRYDVT TGE'PIRDPQG 

cac tge atg gcc aca tct cca ggt gag cca ggg ctg ccg gtg gcc ccg gca age cag cag 
HCMATS PGEPGLLVAP .V S QQ 

ccc cca ttc ccg ggc tat get ggc ggg cca gag ctg gcc cag ggg aag ttg eta aag gat 
S PFLGYAGGPELAQGKLLKD 

gtc etc egg cct ggg gat get ttc tec aac ace ggg gac ccg ccg gtc tgc gat gac caa 

V F R PGDV ? FNTGD L L V C D D Q 

ggc tec etc cgc ttc cat gat cgt act gga gac acc ttc agg tgg aag ggg gag aac gtg 
GFLRFHDRTGDTFRWKGENV 

gcc aca acc gag gtg gca gag gtc tec gag gcc eta gat ctt ccc cag gag gtg aac gtc 
ATTEVAEVFEALDFLQEVNV 

cat gga gcc acc gtg cca ggg cat gaa ggc agg get gga atg gca gcc eta gtt ctg cgt 
YGVT VPGHEGRAGMAALVLR 

ccc ccc cac gcc teg gac cct atg cag etc tac acc cac gtg tct gag aac ttg cca cct 
P PHALDLMQLYTHVSENLPP 

tat gcc egg ccc cga ttc etc agg etc cag gag tct ttg gcc acc aca gag acc tec aaa 

Y ARPRFLRLQESLATTETFK 

cag cag aaa gtt egg atg gca aat gag ggc ttc gac ccc age acc ctg cct gac cca ctg 
QQKVRMANEGFDPSTLSDPL 

tac gtt ctg gac cag get gca ggt gcc tac ctg ccc etc aca act gcc egg tac age gcc 
YVLDQAVGAYLPLTT ARYSA 

etc ctg gca gga aac ctt cga ate tga gaa ctt cca cac ctg agg cac ctg aga gag gaa 
LLAGNLRI* 




